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EDITORIAL 


His Supplement to the second number of 

Volume 47 stands as another memorial to 
Dr. Morton Prince, the founder of this JourNAL. 
When he gave the Journat to the American Psycho- 
logical Association in 1925, Dr. Prince included in 
the deed of gift the provision “that any profit that 
may accrue after paying all legitimate expenses 
shall be used solely for the benefit of the JourNaL 
and its subscribers” (2; 3, p. 3). This Supplement 
At the 
annual meeting of 1951 in Chicago, “the Council 
of the Association voted to spend up to $7,000 of 
accumulated ‘Morton Prince’ fund to print addi- 
tional issues in order to reduce publication lag” (1, 


is a gift to subscribers from such funds. 


p- 559). 

This Memorial Supplement was planned as a 
benefit to interests. Publication lag has 
been increasing. With the marked increase in 
research productivity that has occurred in psychol- 
ogy since the end of World War II, all of the 
journals published by the American Psychological 
Association have tended to accumulate backlogs of 
accepted manuscripts in spite of the fact that the 
journals have been increased considerably in size. 
This Supplement was planned to decrease to some 
extent the backlogs for as many as possible of these 
journals. Each of the Association’s editors has 
been invited to contribute from his backlog of 
accepted manuscripts those which would fall topi- 
cally within the limits of abnormal and _ social 
psychology as defined by the editor of this Jour- 
NAL (3, p. 5). Although a major portion of the 
papers published herein have come from the backlog 
of this JouRNAL, 13 were contributed by the Journal 
of Consulting Psychology, three by the Journal of 
Applied Psychology, and one by the Journal of 
Experimental Psychology. To this extent the plan 
has succeeded in benefitting the contributors to all 
of these journals. To the subscribers of the Jour- 
NAL, this Memorial Supplement is a gift equivalent 
to somewhat more than half of the customary 
annual volume. 

The publication of this Supplement will have 
reduced the typical lag of publication in this 
JournaL from what had become approximately 15 
months to about 9 Again the editor 
wishes to express his intent “to accept manuscripts 
in such a proportion that the lag can be kept under 
a year” (3, p. 4). With his experience of the past 
two years as editor, perhaps he can succeed better 
than he has in the past. In this sense, this Supple- 
ment is a monument to his failure to reject a 
high proportion of the manuscripts 


several 


months. 


sufficiently 
received. 


Maintaining an optimal lag of nine months is no 
easy task for an editor. The allotment of 544 pages 
a volume, with approximately 50 pages devoted to 
book reviews, permits the publication of approxi- 
mately 75 original articles! In 1951, for instance, 
the JourNaL received 290 manuscripts. Only 80 of 


these, in the editor’s judgment, were of a kind from 
which the professional psychological public should 
be protected to save the time of reading and biblio- 


graphical effort. This left 210 deserving of publi- 
cation although considerably more than half would 
and/or did require radical revision or condensation. 
Seventy-six were accepted. Thus, 134 manuscripts 
(approximately 45 per cent) had to be rejected even 
though the editor and/or his associates judged that 
the science would be the loser should they not be 
published. Such rejections are difficult whether the 
editor be identified with either the author or the 
reader, and even though he be identified generally 
with the long-term good of his field. 

Fortunately, there are in the topical areas served 
by this JourNAL a good many other journals where 
the lag in publication is often even shorter than 
nine months. Authors can well turn to these 
journals as outlets for their work. Moreover, the 
fact that these journals exist makes it questionable 
as to whether the Association should allot more 
pages to this Journa. or others like it. What 
proportion of the scientific output in psychology 
should be published in the Association’s own jour- 
nals? Is not anything approaching a monopoly 
likely to be unfortunate even though the agent of 
monopoly be a scientific-professional association? 
These are difficult policy questions which fall to 
the Association’s Publications Board for study and 
recommendation. 


1 Manuscripts given “prior publication” at the expense 
of authors should be added to this number. The pages 
devoted to such papers do not use from the page allot- 
ment of the JourRNaL. Morcover, they go as a kind of 
bonus to the subscriber. 

The fact that a good many manuscripts which would 
be judged acceptable must be rejected in order to main- 
tain an optimal publication lag complicates policy with 
respect to manuscripts submitted with requests for 
“prior publication.” In practice, the policy questions 
have been solved by the editor's referring each such 
manuscript to one of his associates who is asked to make 
a judgment concerning its acceptability without knowl- 
edge that the request for “prior publication” has been 
made. 

2 This statement is based partly on the report (1950) 
of the Committee on Publication Outlets in the Area 
of Clinical Psychology which was chaired by Robert R. 
Holt for the Division of Clinical and Abnormal Psy- 
chology. It is also based partly on recent, personal com- 
munication with some of the editors concerned. 
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In the meantime, this Supplement, another me- 
morial to Morton Prince, permits at least the tem- 
porary reduction of publication lag in the topical 
areas of this Journat to the optimum of approxi- 
mately nine months. 

J. McV. Hunt 
April, 1952 


J. McV Hunt 
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A THEORETICAL ANALYSIS OF THE RELATIONS BETWEEN 
SOCIOLOGY AND PSYCHOLOGY * 


EDWARD C. TOLMAN 
Berkeley, California 


HE following remarks are offered with 
"T apotosis My own research has been 

almost entirely on the nonsocial be- 
havior of nonsocial animals—namely, rats. 
My excuse for presenting the following is 
that I have spent some time, recently, helping 
to produce a cooperative statement of the 
theoretical relationships between sociology, 
anthropology, and psychology (14). 

My main thesis will be that our difficulties, 
when we sociologists and psychologists 
attempt to cooperate, will be lessened if we 
have a clearer methodological scheme as to 
the variables and the causal sequences which 
we respectively wish to study. This paper 


will present my notions as to such a scheme. 

My first point will consist in the assertion 
that for both sociology and psychology, the 
threefold division into independent variables, 
intervening variables, and dependent vari- 


ables provides an essential and useful form 
of analysis. Let us consider this division 
first for the case of psychology. Although 
practically all psychologists today seem con- 
vinced of the value of such a trichotomy, it 
must be admitted that we psychologists still 
disagree somewhat in our conceptions as to 
the natures and definitions of each of the 
three classes of variable. What I shall do 
here, then, is to argue for my own definitions. 

First, the independent variables. These, as 
I see it, are: environmental stimulus situ- 
ations, plus physiologically defined drive 
states, plus constitutional factors (such as 
heredity, age, sex, health). 

Second, the dependent variables. These, 
most psychologists today agree, are “be- 
haviors.” We differ, however, as to how 
behaviors shall be defined. Some talk in 
terms of muscle contractions and gland secre- 


1 Delivered with some modification as part of a sym- 
posium on “Cooperative Research Programs Involving 
Sociologists and Psychologists,” American Psychological 
Association and the American Sociological Society, 
Chicago, September 5, 1951. 
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tions. Others, like myself, talk, rather, in 
terms of actual or attempted organism-en- 
vironment rearrangements and achievements. 
For me, examples of behavior units would be 
a rat running to food in a maze; a cat getting 
out of a puzzle box; a man driving home to 
dinner; a child hiding from a stranger; a 
woman (or a man) spreading gossip over the 
phone;. a student exhibiting leadership in a 
group dynamics experiment; and the like. 
These are the types of behavior unit which I 
would seek to predict and explain. 

Finally, the intervening variables. Here 
the different psychological schools differ even 
more markedly. Gestalt psychologists talk 
either of perceptual gestalten or of gestalt-like 
processes in the brain. The Hull stimulus- 
response school (8) talks of stimulus-response 
connections (“habs”), reaction potentials, 
internal drive stimuli, anticipatory muscle 
contractions, resultant kinaesthetic stimuli, 
etc. I, on the other hand, talk about “be- 
havior-dispositions”—i.e., “needs,” “values,” 
“beliefs,” “differentiations,” “generalizations,” 
“perceptions,” “expectations,” “need-pushes,” 
“valences,” and “behavior-field forces” (14). 
Each of these variables I would conceive as 
a type of “behavior disposition” which can 
eventually be operationally singled out and 
measured in some special standardized situ- 
ation in which this disposition can be tied 
empirically to such and such independent 
variables by such and such a function or 
functions. 

To sum up, then, my notion as to the 
variables of psychology is indicated in the 
upper part of Table 1. But for the sake of 
mnemonic simplicity I shall abbreviate these 
as Stims; Values, cognitions, and behavior- 
field forces; and Acts. See the lower part of 
Table 1. 

Turn now to anthropology and sociology. 
I know but little of these disciplines. Yet I 
shall be bold and assert (see the upper part 
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TABLE | 


A CLASSIFICATION OF THE VARIABLES WITH WHICH PsycHoLtocy Deats 








Types oF VARIABLES 





INDEPENDENT 


INTERVENING 


DEPENDENT 





Constitutional factors 


age, sex, etc. 


Physiological (e.g., 
those involved in hunger, sex, 
etc.). 


Physical and stimulus fac- 
tors (i.c., bricks, stones, foods, 
animate beings, etc.). 


(heredity, 


tions, 
1 


crive’ states 


social 


Stims Values 


Ce gnitions 


Needs, beliefs, values, differenti- 

ations, generalizations, 
valences, 
behavior-field forces. 





Behavioral achievements 
tempted achievements. 


or at- 
percep- 
need-pushes, 





Behavior-field forces 


of Table 2) that anthropology and sociology 
taken together assume, or should assume, as 
their independent variables such factors as: 
climate, geography, history, physical and cul- 
tural resources, other human groups in the 
surroundings plus special personality char- 
acteristics or special psychological processes 


in a member or members of the given group. 
And as their intervening variables these 
sciences should assume: the social structures 
of the group (i.., its political, economic, 
religious, industrial, technological, recrea- 
tional, etc., institutions and organizations) ; 


plus the group’s corresponding ideologies and 
value systems; plus, finally, the resultant 
statuses and roles and role expectations which 
the group by virtue of these institutional 
structures and these organizations imposes 
upon its members. Finally, I shall assert that 
as their dependent variables, sociology and 
anthropology should take the resultant role 
performances of the individual members of 
the group plus what may be called the col- 
lective group productions (whether material, 
immaterial, or symbolic). 

And once again for mnemonic simplicity 


TABLE 2 


A CLASSIFICATION OF THE VARIABLES WITH WHICH SoctoLocy Deats 


Types oF VARIABLES 





INDEPENDENT 


INTERVENING 


DEPENDENT 





Climate 

geography 

history 

resources 

other groups 

Special personality characteristics 

or special psychological proc- 
esses in group members 


Social structures 
organizations 
institutions 
ideologies 

value systems 
statuses 

roles and 
role-expectations 





Individual role performances 
Group productions 





“Surrounds” 
Personalities and 
processes 


psychological 
Roles 





Structures 
Ideologies 


Individual role performances 

Group productions (with resultant 
continuation, modification or 
breakdown of the given social 
structure) 











THEORETICAL RELATIONS BETWEEN SocloLocy AND PsycHOLOGY 


I shall abbreviate as shown in the lower part 
of Table 2. 

Combining now into a single table, my total 
schema is given in Table 3. And in Table 4, 
the corresponding types of theoretical and 
research problems are indicated. 

Let us examine the items in Table 4, read- 
ing from the top down. 

Sociological (or anthropological) theory is 
concerned, I assert, with specifications and 
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definitions of the variables (A), (B), and 
(C), especially (B). Such theorizing is pri- 
marily occupied, that is, with questions as to 
the basic forms of social structure, the nature 
of institutions, organizations, groups, the 
nature of ideologies and value systems, and 
the nature of roles and role expectations. 
Social etiology is the term I would use to 
to cover the study of the causal relation indi- 
cated by the arrow (A) to (B). How are 


TABLE 3 


THE INTERRELATIONS AMONG PsYCHOLOGICAL AND SOCIOLOGICAL VARIABLES 








Types oF VARIABLES 





INDEPENDENT 


INTERVENING DEPENDENT 





(A) 
Sociology Surrounds 
Anthropology Personalities 


psychological processes 


(D) 


Psychology Stims 





Values and cognitions 
behavior-field forces 


(B) (C) 

Role performances 

Group productions (with re- 
sultant continuation, modi- 
fication or breakdown of 
the given social structure) 


Structures 
ideologies 
roles 


(E) (F) 

Acts (with resultant successes 
or failures re the individ- 
ual’s goals) 








TABLE 4 


A List oF THE Types oF THEORETICAL AND RESEARCH 
ProBLEMs THAT RESULT FROM COMBINING Psy- 
CHOLOGICAL AND SOCIOLOGICAL VARIABLES 





Sociological (anthropological) theory concerns the defin- 
ing characters of (A), (B), & (C), especially of (B).* 


Social etiology investigates the causal relationships 
(A)>(B). é 

Social functioning investigates the causal relationships 
(B)->(C). 

Psychological theory concerns the defining characters of 
(D), (E), and (F), especially of (E). 

Psychological (personality) etiology 
causal relationships (D)->(E). 

Psychological (personality) functioning investigates the 
causal relationships (E)->(F). 

General social science theory considers the congruence 
of the two sets of constructs—especially those of (B) 
with those of (E). 

Social psychology investigates (B)->(D)->(E) or, in 
less abbreviated form, the longer sequence (A)->(B)—> 
(C)—(D)-(E)-(F). 

Psychological sociology investigates (E)—>(A)->(B) or, in 
less abbreviated form, the longer sequence (D)—>(E)—> 
(F)->(A)—(B)-(C). 


investigates the 





® These letters refer to the classes of variables as indi- 
cated in Table 3. 


social structures, ideologies, and roles deter- 
mined by variations in the history, geography, 
climate, resources, etc. (i.c., the surrounds) 
of the given group or by the unique person- 
alities or unique psychological processes of 
one or more members of the group? Con- 
sider some examples of investigations in this 
area.” One of the neatest studies, we all 
know, in which a change in the physical 
“surrounds” was observed to have brought 
with it a special resultant change in social 
structures, is that of Linton (12) on the 
Tanala and Betsileo of Madagascar. There, 
a change in a group’s locale with an accom- 
panying change from dry-rice culture to 
wet-rice culture produced a major shift in 
institutions, ideologies, roles. Generally 
speaking, however, studies in social etiology 
of this sort are difficult to find since the 


2 When it comes to presenting examples in the areas 
of sociology or anthropology, I must apologize. My 
knowledge of the relevant literatures is painfully meager. 
I shall no doubt pull some boners both of commission 
and omission. I can merely hope that my thesis will 
be somewhat clarified by my attempt. 








294 


resultant changes in social structure usually 
take generations to become complete. Some 
historians have, of course, attempted to re- 
construct some of the very general laws of 
social etiology and one of the most ambitious 
attempts is, of course, Toynbee’s study of 
history (15). But how valid such wide-flung 
historical accounts and interpretations may 
be I for one.do not know. 

Finally, however, other studies of social 
etiology, invoiving shorter time-spans and 
controlled manipulation of the (A) variables, 
would appear to be the action research 
studies of Lewin and his students (11) or 
those of the Tavistock Institute (see for 
example [g9]). In all such studies controlled 
changes in the surrounds or in the member 
personalities or in basic psychological experi- 
ences of some of the members are introduced 
to the group and the resultant changes in 
group structure, in ideology, and in roles are 
observed. And some general laws are, I 
believe, beginning to emerge. 

Turn now to social functioning, indicated 
by the arrow (B) to (C). Studies of this 


causal sequence would be ones concerned 
primarily with the question of how various 


combinations of institutions, ideologies, and 
roles interact and produce the final role 
performances or group productions which 
they do. 

A simple type of investigation in this area 
would be, I believe, that of Allport (2) on 
the J-curve. This is a study of the relation 
between the strength of institutional pre- 
scriptions and the resultant distributions of 
degrees of role conformance. 

Before passing on, it should now be pointed 
out, parenthetically, that investigations of the 
relationship (B)—>(C) can only be partially 
(i.c., only “analytically”) separated from 
studies of the (A)->(B) relationship. For, 
in order to produce the desired changes in 
the (B) variables for the purpose of studying 
their effects upon the (C)’s, one actually has 
to vary the (A)’s. And, similarly in study- 
ing the (A)-—>(B) relationships, the resultant 
(B)’s have to be identified by the (C)’s 
which they produce. The way out of these 
difficulties seems to be that when we are 
interested primarily in the (A)—>(B) rela- 
tionship we try to make the conditions such 
that the observed (C)’s can be assumed to 
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be directly correlated with the (B)’s, so that 
we can then take these (C)’s as direct, almost 
100 per cent indicators of the (B)’s. And 
in complementary fashion when we are in- 
terested primarily in the (B)—>(C) relation- 
ships we try to make the conditions such that 
the (A)’s, which we manipulate, can be 
assumed to be direct, 100 per cent correlates 
of the (B)’s, whose effects upon the (C)’s 
we are prirnarily interested in investigating. 

Turn to the next item. 

Psychological theory like sociological theory 
is concerned with specifications and defini- 
tions of the variables, in this case, the (D), 
(E), and (F) variables, especially the (E)’s. 
I have already presented above my own 
notions of these three classes of variable. 

Typical researches under the next heading, 
psychological etiology, i.e., the arrow (D)—> 
(E), would be: studies in developmental 
psychology, i.e., the laws of the growth and 
development of the personality as a result of 
different early constitutional and environ- 
mental stims; and also the classical investiga- 
tions of the generalized laws of perception, 
motivation and learning as a result of such 
and such typical stims or repetition of stims. 
And here, as in the situation for sociology, 
it is obvious that studies of these (D)—>(E) 
relations can be made only under conditions 
in which it can be assumed that the (F)’s 
vary consistently with, and mirror directly, 
the variations in the (E)’s. 

Consider next investigations under psycho- 
logical (personality) functioning represented 
by the arrow (E)—>(F). Here the (E)’s 
(the beliefs, values, and perceptions) would 
be varied or more often combined in pre- 
scribed new ways and the resultant variations 
in the final behaviors observed. Studies of 
the defense mechanisms would fall under 
this head. Important names in this field are, 
of course, those of Freud, Murray, and 
Lewin. All three have, it seems to me, been 
frequently (though not exclusively) con- 
cerned with these (E)—>(F) relationships. 
Finally, I also believe that the so-called “new- 
look” studies in perception (see, for example 
[4]), in which variations of need, belief, and 
value are shown to affect perception and the 
resultant responses, likewise fall here. 

Turn now, at last, to our main topic—that 
of the joint problems of the sociologist and 
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the psychologist. These I have called, respec- 
tively, General social science theory, Social 
psychology, and Psychological sociology. 

General social science theory. This in- 
volves first the development of constructs for 
sociology which are congruous with those 
developed for psychology and vice versa. 
And here I would put in a plug for the book 
by Parsons, Shils, et al—Toward A General 
Theory of Action—about to be issued by the 
Harvard University Press (14) in which 
Parsons and others (including myself) have 
attempted to make more explicit our defini- 
tions of the intervening variables for sociol- 
ogy and anthropology on the one hand, and 
for psychology on the other, in a way to 
make the two sets of constructs more con- 
gruent with one another. Second, general 
social science theory also involves considera- 
tion of the types of causal interaction to be 
assumed between the two sciences. This 
latter question I am answering here by intro- 
ducing the two rubrics of social psychology 
and psychological sociology. To these let me 
now turn. 

First, Social psychology. ‘This hybrid 
science is concerned, I would argue, primarily 
with relationships which may be indicated 
as (B)—>(D)->(E), where these letters refer 
to the classes of variables indicated in Table 3. 
It is to be noted, however, that the relation- 
ship (B)—>(D)->(E) is really an abbrevi- 
ation of the more extended relations (A)—> 
(B)—(C)-(D)->(E) > (F) (again, see 
Table 2). 

As typical studies in social psychology we 
might cite: first, studies of the effects of 
different cultures (B)’s on resultant person- 
ality structures, (E)’s. Such studies say as 
those largely initiated by Benedict’s Patterns 
of Culture (3) but which now have become 
very numerous. 

Second, “opinion survey” and “polling” 
studies—where the opinions of an individual 
are shown to be related to the social, eco- 
nomic, and political backgrounds (i.e., the 
(B)’s) of the group to which he belongs 
(see for example [10]). 

Third, attitude and prejudice studies—for 
example, the California studies of the “au- 
thoritarian” and democratic personalities (1) 
where the emphasis is upon an analysis of 
the difference in personality types which 


result from different subcultures and from 
different family training procedures. 

Fourth, many of the studies of Stouffer and 
his co-workers reported in The American 
Soldier (13) where again the effects of a new 
subculture (i.e., the Army) upon the indi- 
viduals’ personalities and psychological proc- 
esses, the (E)’s, were investigated. 

Turn next to Psychological sociology. 
Here fall studies in which the prime concern 
is the causal sequence (E)—>(A)-—>(B) as an 
abbreviation for the longer sequence—(D)—> 
(E)—(F)—(A)-(B)—(C), where these 
letters refer to the classes of variables indi- 
cated in Table 3. They are studies in which 
unique personalities’ (E)’s or unique indi- 
vidual psychological processes, also (E)’s, 
would be varied in order to produce resultant 
variations in the social structure, ideology, 
and roles of given social system (B)’s. 

Examples of such relationships are never 
too neat. It would seem to me, however, 
that Erikson’s study (6) of Hitler’s. psycho- 
logical make-up (i.e., Hitler’s (E)’s) and its 
reaction back upon the institutional, ideo- 
logical, and role set-ups of the German 
people might be a good example. Also, I 
believe studies on the effects which, say, 
acculturated individuals from primitive so- 
cieties do or do not have upon their native 
tribes, when they return, would also fall here 
(see also [6]). Further, it would seem that 
the historians who believe in the “Great 
Man” theory of history have made studies or 
at least statements which, if valid, ought also 
to be included. Finally, there would also 
come here, I believe, studies such as those of 
mob formation and mob action—i.e., studies 
of how the distorted beliefs and perceptions 
of single individuals can result in wholly 
new, though perhaps temporary, social struc- 
tures and ideologies (5). 

Finally, I would like to point to the Har- 
vard Ramah Project as a marvelous source 
for studies falling into all of the nine cate- 
gories. ‘There are, as is well known, five 
different groups—Navahos, Zunis, Spanish- 
Americans, Mormons, and Texans—all living 
in the vicinity of Ramah, New Mexico, and 
interacting with the environment and with 
each other. Here sociology, anthropology, 
and psychology meet in a big way; and, in 
fact, a recent summary (7) of the studies 
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going on at Ramah indicates that investiga- 
tions have been made which fit rather neatly 
into each of the nine categories in Table 4. 

First, Sociological (anthropological) theory. 
Florence Kluckhohn, Fred Strodtbeck, and 
J. M. Roberts have been led to develop a new 
set of constructs (B)’s for designating ideolo- 
gies and value systems which makes the 
comparisons between the five cultures more 
intelligible. These constructs as to belief- 


value complexes they list as: 


1. Beliefs concerning the innate predisposition 
of man. He is good, evil, neither good nor evil. 

2. Beliefs as to man’s relation to nature. (Man 
is subjected to Nature, Man is in Nature, Man has 
rational mastery over Nature.) 

3. Values as to time. (The emphasis may be on 
Past, Present or Future.) 

4. Values as to personality types. 
in becoming or Doing.) 

5. Beliefs concerning the dominant relations of 
man to other men. (Lineal, Collateral, Individual.) 


(Being, Being 


Thus, when it came to considering these 
five different cultures and their general atti- 
tudes towards man and the universe, these 
investigators were forced to develop this new 
set of sociological anthropological intervening 
variables. 

Social etiology. An example of a study at 
Ramah coming under this head could be that 
of E. Z. Vogt and Guy Pauker. They 
hypothesized that “if certain values of one 
culture are valued by another culture, the 
second culture will more easily accept other 
values originating in the first culture.” Evi- 
dence for this hypothesis they found in a 
study of the differential participation in 
American democracy of the Navahos and the 
Zunis. 

Social functioning. An example of a study 
of this (B)—>(C) relationship is that of E. Z. 
Vogt and M. S. Edmonson on humor and 
laughter. They find that laughter occurs 
repeatedly in the same contexts and plays 
upon the same themes. Each culture has its 
own characteristic inconsistencies or strains 
in its ideologies and value systems and such 
strains or (B)’s express themselves in cer- 
tain corresponding culturally-typed humor 
and laughter, that is, in certain character- 
istic (C)’s. 

Turn to psychology. 

*sychological or personality theory. The 
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Ramah Project will also provide a beautiful 
locus in which to test the adequacy for dif- 
ferent cultures of the (E) constructs mostly 
developed from studies in our own culture, 
but which we psychologists assume to hold 
for all. And I shall now assert dogmatically 
that I am sure that my (E) constructs of 
needs, beliefs, values, generalizations, percep- 
tions, need-pushes, valences, and behavior- 
field forces will be found equally valid and 
useful in studying individuals in each of the 
five cultures. 

Psychological or personality etiology, i., 
(D)—(E). It seems obvious that we would 
expect to find the same Jaws of learning, per- 
ception, memory, problem-solving, and _per- 
sonality development in all five cultures. 
Although the laws of learning, perception, 
memory, problem-solving, and_ personality 
development are already well established in 
our culture, in order really to be sure that we 
have arrived at the universal laws we need 
some such set-up as that at Ramah, in which 
the relevant (A)’s for learning, perception, 
etc., or for personality development are given 
different values in the different cultures. 
Take, for example, the case of personality 
development. It has been found by Sears, 
Whiting, Faigin, Hollenberg, and Sperry 
that where one parent is more important as 
caretaker and disciplinarian, children of the 
same sex as the important parent will show 
more guilt than the opposite sex children. 
And this seems to be a universal law as far 
as they have yet tested it. Thus, to quote: 
“This conclusion, reached from carefully 
matched studies made on Texan, Mormon, 
and Zuni children (cultures with varying 
sex dominance), if verified on further cul- 
tures, should be important for the general 
problem of specifying the determinants of the 
internalization of values and related proc- 
esses in the development of personality.” 

Psychological or personality functioning 
(E)-(F) can also be studied in Ramah in 
so far as the different cultures present differ- 
ent types of conflict (i.e., (E) combinations) 
to the individual and should, therefore, lead 
to characteristically different types of defense 
mechanisms. 

Turn, now, finally to the last three items 
in Table 4—which are of particular interest 
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for our joint research. First, the problems 
of general social science theory. Ramah, I 
am sure, would be the place to try out the 
applicability and usefulness of the constructs 
in the already-referred-to book, Toward A 
General Theory of Action by Parsons et 
al.(14). Parenthetically, however, I may 
remark that, never having been at Ramah, 
I am not at the moment too clear as to just 
how the theoretical analyses presented in the 
book would actually be applied. That is, I 
cannot now envisage just what the manner 
and the to-be-expected results of such an 
attempted application would be. I am sure, 
however, that such an attempt would prove 
an exciting enterprise. 

Now for social psychology—the sorts of 
causal determination involved in the sequence 
(B)—(D)-(E). An investigation which 
appears to be immediately along this line is 
that of J. M. Roberts and Mary Lou Bensley. 
They report that “differences in social or- 
ganization seem to be reflected in the fantasy 
productions of children. This is illustrated 
by comparing protocols of Mormon and 
Texan children when given the Thematic 
Apperception Test. For example, Mormon 
children, who live in a culture of intragroup 


solidarity, look for help when they need it 
from anyone who’s around; Texan children, 
who live in an individualistic culture, almost 
always asked a specific mother or father 


figure for help.” Different (B)’s produce 
characteristically different (E)’s. 

Finally, turn to the relationships which I 
have designated as those of psychological 
sociology, that is, (E)—(A)—>(B). As an 
example of an investigation of this relation- 
ship I would suggest the analysis made by 
E. Z. Vogt and Wilfred Bailey of water 
witching by the Texans in the Ramah area. 
These investigators found that water witch- 
ing is seldom practiced in that part of Texas 
from which these Texan settlers originally 
came. Therefore, the use of water witching 
by them in the Ramah area, where water is 
scarce and wells are costly, seems to indicate 
that such water witching has the function of 
a ritualistic means of alleviating individual 
anxiety. In other words, I am arguing that 
the anxiety, (E), of the individuals regarding 
water presents a special psychological proc- 
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ess, an (A), to the Texan community which 
results in ritualistic, magical beliefs (B) which 
exhibit themselves finally in the role per- 
formances (C) involved in water witching. 

One last remark. Probably every reader 
knows that the Regents of the University of 
California imposed a special non-disloyalty 
affirmation upon the California faculty and 
employees in June 1949, and that this is still 
being fought in the courts by eighteen of us. 
It is intriguing now to ask: if one were to 
make social science investigations of this set 
of social events, where in this schema would 
the pertinent researches fall? My answer 
would be in two places. 

First, there would be a type of research 
falling under the rubric of social psychology. 
That is, in order to explain the actions of us 
individual nonsigners one would have to 
make a study of how special kinds of sub- 
cultures, (B)’s, in our respective individual 
upbringings and backgrounds, have pro- 
duced for each of us a set of present needs, 
beliefs, values, etc., and resultant final field- 
forces, (E)’s, which caused all! of us to behave 
as nonsigners. ‘This social psychological 
study would, however, be no simple one. 
For, as nearly as I can make out, we eighteen 
seem to divide into at least three and pos- 
sibly four different subcategories in terms of 
the crucial determining sociological and cul- 
tural factors which lie behind us in our 
respective histories but which all result in our 
one common set of truculences, (E)’s, of not 
being willing to sign. 

Second, there would be a type of research 
possible (at least I hope it will turn out to be 
possible) to be done some years hence which 
would fall under the rubric of psychological 
sociology. Such a research would investigate 
the degree to which these peculiarities of value 
systems, cognitions, and personalities, (E)’s, 
in us, the nonsigners, and also in others 
throughout the academic world who have 
been giving us moral and financial support, 
will at that future date prove to have had 
some real effect upon the (A)’s and the 
resultant (B)’s of our academic institutions 
so that this sort of attack upon academic 
tenure and freedom will be less likely to 
reoccur. I, of course, hope that it will turn 
out that we really will have had some impor- 
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tant observable effects and that thus there 
will be provided material for a rich study in 
psychological sociology. But at this juncture 
I am keeping my fingers crossed. It may 
well be that the McCarthy psychology is 
going to win out in the end over our non- 
signing psychology in determining the future 
course of this country’s social institutions. If 
it does, we psychologists and sociologists 
must be prepared to close up shop right then 
and there just as our confréres had to do in 
Nazi Germany and have done in Com- 
munist Russia. 
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VALIDATION OF A SOCIOMETRIC REGROUPING PROCEDURE 
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Psychometric Affiliates 


N spite of the assumed importance of job 

satisfaction and interpersonal relations of 

workers a survey of the relevant litera- 
ture (1, 2, 3, 6, 7, 8, 9) shows a definite lack 
of systematic evaluation of these important 
determiners of employee attitudes and output. 
In a previous study (10) the author explored 
the relationships existing between interper- 
sonal desirability of workers (as determined 
by co-worker ratings), job satisfaction, and 
ten other more specific attributes of job sat- 
isfaction. Results of this study showed inter- 
personal desirability to bear a very definite 
relationship to job satisfaction (r of .82). 
Further analysis of the data also showed the 
interpersonally desirable worker 

1. to have a greater feeling of job security, 

2. to believe he has good working condi- 
tions, 

3. to consider his co-workers very friendly, 

4. to believe the company is interested in 
his welfare, 

5. to believe he possesses opportunity of 
communication with management, and 

6. to possess confidence in the good inten- 
tions of and good sense of management. 

In the belief that sociometric regrouping 
of the work teams would produce an in- 
crease in the amount of production and 
quality of work as well as a reduction in 
turnover the author—working in conjunction 
with the supervisor and foreman of the 
work groups, and a top member of manage- 
ment—set about validating the sociometric 
regrouping. 

SuBJECTS 

Subjects used in this study are identical 
with those used in the previous study (10). 
They comprised four total work groups—a 
carpenter group of twenty members, another 
carpenter group of eighteen members and 
two separate groups of bricklayers, each hav- 
ing sixteen members. All workers were well 
acquainted with each othcr’s personality and 
skill. None of the workers was in any way 
related to each other and few, if any, had 


known other members of his group previous 
to their present job. 


Tue CRIreria 

Effectiveness of the sociometric regrouping 
was ascertained by means of four criteria— 
job satisfaction, turnover rate, an index of 
labor cost, and an index of materials cost. 

Job satisfaction was measured by the Tear 
Ballot for Industry (4, 5) previous to the ex- 
perimental period and upon completion of 
the three-month experimental period. Turn- 
over records for the groups were taken from 
company kept, monthly compiled personnel 
records. Actual labor and materials costs 
were made available from cost account records 
kept on a “row-of-units” basis. Labor and 
material costs are unaffected here by size of 
work force. Because of the company’s desire 
for anonymity of identity and of cost figures 
no actual monetary results will be reported 
here. Instead the indices of labor cost and 
materials cost were arrived at by taking 
actual expenses in dollars and dividing 
through by a constant to arrive at the par- 
ticular index used. Fluctuations in output 
criteria are displayed graphically in Figures 
1 and 2. 


EXPERIMENTAL PROCEDURE 


The construction job upon which these groups 
were working was a large housing project on the 
outskirts of Chicago. The project was separated 
into two parts by a highway running through its 
center. All workers had previously been split up 
into two separate and comparatively remote groups 
—each working on a different side of the highway 
under different foremen and having little or no 
contact with the other group. 

Houses were constructed in rows of eight—be- 
ginning on the highway and moving away. Each 
row was identical with every other row, but houses 
within rows varied. Each group, then, performed 
identical jobs, thus controlling the job content 
factor. 

No disrupting incidences occurred during the 
three-month course of the experiment and simi- 
larity of working conditions was guaranteed by 
the propinquity of the two groups. 

Two groups were set up. The experimental 
group consisted of the carpenter group of twenty 
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no significant difference—critical ratio of —.78— 
while turnover records of the previous three-month 
period also indicated a similarity, the critical ratio 
being —.10. It can be assumed, therefore, that 
the two groups are adequately matched. 

All statistical comparisons were made between 
experimental and control groups during the same 
three-month period because of the effect of ad- 
verse weather conditions upon out-of-doors jobs 
and other contaminating influences due to the 
nature of the occupations studied (e.g., lack of 
other job opportunities for the worker during the 
winter months, etc.). 


ResuLts aNp Discussion 
As shown in Table 1 the experimental 
group varied significantly from the control 
group—the former being markedly superior 


TABLE 1 
Critica, Ratios CoMPUTED BETWEEN EXPERIMENTAL 
AnD Controt Groups FoR THREE-MoNTH 
Periop PRECEDING AND FOR THREE- 
MonTH Periop oF THE EXPERIMENT 
Labor and materials costs are based on a “row-of- 
units” basis (8 houses), 
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members and a bricklayer group with the other 
two groups serving as the control. Regrouping 
was on the basis of the previously collected co 
worker choices. In all cases the worker received 
his first (30 times) or his second (6 times) choice 
of work partner. Actual rearrangement was ac- 
complished when the supervisor in assigning the 
day’s work also announced work partners. This 
in no way violated previous procedures. 

However, before the rearrangement of work 
teams was put into effect both experimental and 
control groups were allowed to complete two rows 
of units in order that they might be tested for 
present homogeneity in work performance and to 
insure an absence of contaminating variables. 
Further comparisons between groups were also 
made on all criteria utilizing records for the pre- 
vious three-month period. Output criteria (labor 
and materials costs) when analyzed for this time 
span yielded critical ratios of —.32 and —.58 
respectively. A previous measurement of job satis- 
faction (prior to the experimental period) showed 





Job satisfaction 
Before experimental 
peri od 
During experimental 
period 
Turnover rate 
Before experimental 
period 
During experimental 
period 
Index of labor cost 
Before experimental 
period 
During experimental 
period 
Index of materials cost 
Before experimental 
period 
During experimental 
period 


—.58 


3.10 





on all four criteria. Job satisfaction scores 
show a mean increase of 5.5 points with a 
critical ratio of 3.52. Turnover in the socio- 
metric group is virtually nonexistent with 
a mean of .33 per month, the critical ratio 
between groups being 14.65. The marked 
decrease in turnover rate is of especially 
great significance since the building trades, 
at least in the Chicago area, are currently 
experiencing a critical skilled labor shortage 
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and are characterized by an unusually high 
turnover rate even for a “transient-employer” 
industry always noted for its high turnover. 
Labor costs and materials savings also show 
definite drops having critical ratios of 5.45 
and 3.10 respectively. 

Definite financial savings which cannot be 
presented here led the company’s chief con- 
struction engineer to state in his report to 
management, 


. savings due to this psychological procedure 
have exceeded those of any previous work savin 
device or any combination of five previous on 
saving methods. Financial benefits are such that 
we are now constructing every 29th building en- 
tirely free from labor and materials costs. Even 
greater financial gains would occur were it possible 
to evaluate monetarily savings due to the great 
reduction in turnover. 


It must be noted, however, that the building 
trades with their “buddy-work-teams” are 
especially suited for a sociometric regrouping. 
The main drawback to a universal adoption 
of sociometric procedures is the shifting of 
workers into different teams while avoiding 
any change in job duties which would have 
a negative effect on job performance, necessi- 
tate retraining the worker, etc., and so cancel 
any benefits accruing from the sociometric 
regrouping. Among these building trade 
workers such a limitation was not a hin- 
drance, for these workers, do not operate 
according to assembly line procedures and 
so must be more or less equally adept at all 
phases of construction. The necessity of 
working together in a rather mutually co- 
operating series of tasks may also be viewed 
as conducive to stress upon interworker 
relations. 

Further suppositions which must be met 
lie on management’s side of the ledger. It 
is necessary that management have a demo- 
cratic approach to the government of workers 


as well as recognizing the importance of 
group relations and manifesting an interest 
in worker preferences. Any adequate han- 
dling of such procedures as used here should 
be based on objective study and a recognition 
of social stimuli impinging upon the worker, 
his attitudes and expectations. 

Looking back upon the results of both this 
study and its predecessor it would seem that 
a careful employment of “human relations” 
procedures increases the worker’s sense of 
belongingness, and molds worker and man- 
agement together in a mutually satisfied 
group solidarity. Sociometric procedure as 
here used has resulted in increased satisfaction 
on the part of the worker and greater fi- 
nancial returns on the part of management. 
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PERCEPTUAL DEFENSE AND PROJECTIVE TESTS 
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ROJECTIVE approaches to personality study 

and laboratory work in the area of 

needs and perception have much in 
common. Both present to a subject for inter- 
pretation a stimulus which is ambiguous in 
some way. Although stimulus ambiguity is 
of cruciai importance in both of these situa- 
tions, relatively little critical attention has 
been given to its implications. The experi- 
ment which we are reporting here requires 
a clear understanding of what is meant by 
ambiguity. 

In essence, an ambiguous stimulus is one to 
which a number of interpretations may be 
reasonably given. When a stimulus is not 
ambiguous, there is no opportunity for indi- 
vidual differences in interpretation. The 
presentation of such a stimulus provides us 
with little or no information about the per- 
son interpreting or identifying it. Projec- 
tive testers have recognized that the more 
ambiguous a stimulus is, the greater the free- 
dom for personal interpretations. 

Few stimuli are completely ambiguous. 
Some information is provided by the stimulus 
which limits the range of interpretations that 
may be given. For example, in the exposure 
of words at decreasing tachistoscopic speeds, 
a point is reached when enough stimulus in- 
formation is communicated to the subject to 
narrow his guesses to words rather than ob- 
jects or people, even though it is not enough 
for correct identification. In the pre-recogni- 
tion guesses, therefore, we are not measuring 
solely what responses the subject (S) has 
available in terms of past experience, but the 
interaction of these available responses with 
the characteristics of the stimulus which have 
been successfully communicated. In their 
zeal to shift attention to the personality deter- 
miners of perceptual recognition, recent per- 
ception experiments have almost ignored the 
contributions of the stimulus to perception 
and projection. Even ink-blots have impor- 
tant stimulus characteristics. 

One of the implications of this last point is, 


that for any given stimulus, not all interpre- 


tations are appropriate. As a consequence, it 
is possible to say that the ambiguity of a 
stimulus depends upon the number of reason- 
able interpretations that can be given to it. 
We would suggest that what is reasonable 
can only be determined by observer agree- 
ment. Some normative approach is neces- 
sary. Thus under appropriate circumstances 
few of us would disagree that the interpre- 
tation of a can of evaporated milk as a baby’s 
diaper represents an unreasonable if not psy- 
chotic evaluation. 

At this point an interesting personality 
variable can be identified: It is the individ- 
ual’s capacity for appraising the limits of 
reasonable interpretations. The capacity to 
make these judgments is what clinicians have 
referred to as contact with reality or ego 
strength. 

An excellent example of this in clinical 
practice is seen in the use of Beck’s (1) F+-% 
on the Rorschach test to indicate a patient’s 
contact with reality. The S’s responses are 
examined to see whether they are more in 
keeping with those found among normal in- 
dividuals than among psychotic patients. The 
question is: Would most normal people agree 
that the blot does look like what S suggested? 
This agreement or lack of agreement is based 
upon norms for normal and psychotic people 
and is scored on the Rorschach record as 
good form (F-+-) or poor form (F—). 

We must recognize that distortion beyond 
reasonable limits can occur in either direc- 
tion; that is, an individual can give an inter- 
pretation that other observers would not 
agree was reasonable, or he can fail to make 
an interpretation that others would feel was 
demanded by the stimulus. 

In the former case the interpretation may 
reflect a strong adient need and may be 
readily accepted or even sought by S. In 
psychotics or persons with weak egos an in- 
terpretation reflecting an adient need may be 
given even if it is out of contact with the 
stimulus in terms of norms. In well-adjusted 
people such an adient need interpretation 
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will only occur when the stimulus is ambigu- 
ous enough to allow the interpretation to fall 
within its reasonable limits. 

In the latter case which involves actual 
avoidance of the threatening interpretation, 
contact may be so poor (in psychotics, for 
example) that there are virtually no limits 
to the extent of the denial or avoidance. In 
normal Ss the threatening interpretation may 
be avoided as long as reasonableness permits. 
At some point, however, as ambiguity de- 
creases, the normal person must come to 
grips with the threat. 

The concepts we have been discussing have 
in some ways been demonstrated in labora- 
tory experiments on need and perception (2, 
4, 5, 6, 8, 9). In some of these experiments 
the terms perceptual sensitization and defense 
have been applied to the two ways in which 
needs influence interpretations. The similar- 
ity between these laboratory experiments and 
the projective test situation has already been 
pointed out. Both require stimulus ambigu- 
ity. However, the experimental extension of 
the principle of perceptual defense to the 
Rorschach remains to be accomplished. 

It is a fundamental assumption in projec- 
tive psychology that the individual will inter- 


pret ambiguous material in terms of his own 


needs. Less explicit in projective theory is 
the fact that what an individual does not see 
may represent important clues concerning 
the sources of threat to the individual. Clin- 
ical psychologists have shown an awareness 
of this. Recognition of this notion may be 
found in the Freudian use of free association, 
dream interpretation, and in the writings, 
The Psychopathology of Everyday Life. The 
testing of the limits technique on the Ror- 
schach test is an example of this clinical 
awareness in action. In this procedure, the 
examiner notes what kinds of responses, 
which are normally expected in a record, are 
not given by the patient. In an effort to 
discover whether these omissions represent 
disinterest or real emotional avoidance, the 
clinician systematically points out to the sub- 
ject these percepts, asks him whether he can 
now see them, and requests him to indicate 
other examples of the same sort. 

We believe that it is of vital importance to 
show that when S does not give certain inter- 
pretations, aggressive ones, for example, this 
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failure may be the result of a defense against 
the anxiety produced by unacceptable ag- 
gressive drives. The experiment which we 
shall report presents direct confirmation of 
this hypothesis. When S denies the presence 
of certain percepts in the Rorschach ink-blots, 
it often indicates tension in the content area 
indicated. 

In the present experiment, we determined 
the degree of threat to each S of three need 
areas, and related this threat to his willing- 
ness or ability to see corresponding need- 
related percepts on the Rorschach test. The 
degree to which the need areas, aggression, 
succorance, and homosexuality were threaten- 
ing to S was evaluated by a modified word 
association technique. The effect of disturb- 
ance in these need areas upon Rorschach per- 
formance was determined by the use of a 
variation of the McReynolds’ Concept Choice 
Test (7). 


PROCEDURE 


The word association test consisted of 89 words. 
It began and ended with 10 neutral words and an 
additional 10 neutral words occurred in the middle 
of the list. Ten aggressive, 10 succorant, and 10 
homosexual words were scattered throughout the 
series, separated by additional neutral words. Stand- 
ard word association procedures were used in the 
administration of the test. Association time was 
measured to the nearest hundredth of a second and 
was recorded for each response word. The test is 
presented in Table 1. 

The concept choice test was made up of 10 aggres- 
sive, nine succorant, and nine homosexual concepts. 
In other words, Rorschach responses were selected 
which carried aggressive, succorant, or homosexual 
implications.! Each concept occurred twice, in dif- 
ferent places in the Rorschach test. In one loca- 
tion it was listed as F+ in Beck’s frequency tables. 
In the other location it appeared as an F—. There- 
fore, each of the concepts in the test was so selected 
that in one instance it was often given by normals, 
and in the other instances it was generally found 
only in psychiatric patients. 

The concepts were randomized for both order of 
presentation and for position in the Rorschach 
series. Administration of the test involved con- 
fronting each subject, individually, with the loca- 
tion of each concept and asking him, “Could this be 
a ——?” To insure that S would feel minimally 
threatened by the test situation itself, he was assured 


1 We would like to express our gratitude to the staff 
of the Veterans Administration Mental Hygiene Clinic 
of Baltimore, Maryland, and to the Veterans Adminis- 
tration for their generous help and cooperation on this 
experiment. Appreciation is also due Virgil R. Carlson, 
who collected most of the data and helped in the 
analysis. 
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TABLE 1 


THe Worp-Association Test * 


CLOCK 
SUPPORT 
FLOWER 
FAIRY 
BUILD 
KILL 
OCEAN 
BREAST 
STOVE 
BLOW 
LONG 
STAB 
LAMP 
ASSISTANCE 
SALT 
COCK 
LIGHT 
INTEREST 
STYLE 
INCREASE 
MORNING 
TOWN 
MARKET 


NEEDLE 
WORK 
CARPET 
HIGH 
SING 
WALK 
SOUR 
LOCAL 
LAUGH 
COLOR 
MURDER 
STEM 
DEPEND ON 
TRAVEL 
QUEER 
DRAW 
INJURE 
AFFORD 
NIPPLE 
SWIFT 
HOMO 
LAST 
HATE 


ZY®ZPZZZZZZZZZZ 
ZY°ZSZPZLZ*°ZrZxrZ’Z 


~ 
— 

~ 

~ 


XFZIZY%ZrZ 
ZZZZZZZ 


STORY 
PHONE 
STAND 
BUTCHER 
WINDOW 
HELP 
BOOK 
FRUIT 
WHISTLE 
DESTROY 
SOFT 
ALONE 
DOOR 
BUTTOCKS 
TALL 
FIGHT 
TABLE 
NURSE 
EASY 

ASS 
RIVER 
PUNISH 


FREQUENT 
BEHIND 
STATE 
CONTEMPT 
PREFER 
HELPLESS 
SUBURBS 
MALE 

FLY 
BEAUTIFUL 
SPEAK 
MOUNTAIN 
RUN 
BELIEVE 
ROUGH 
MOVE 
CITIZEN 
STOCK 


ZY°ZPHZZZZ 


~ 
— 


ZAZZZZZZZZAZZLIZAZYZHZIZYZ 


ZYZPZIZVZrZ 


>Zx 





* A=Aggressive word; S=Succorant word; H=Homosexual word; N=Neutral word. 


that there were no right or wrong answers. This 
test is shown in Table 2. 
The 35 Ss in the experiment consisted of a mix- 


ture of college students, college students in an out- 


patient clinic, and psychoneurotic patients at the 
Baltimore Veterans Administration Mental Hygiene 
Clinic.” 

The extent of disturbance for a need area on the 
word association test was measured in the follow- 


ing way. For each S, the median, the first, and 
the third quartile of the reaction times for the 30 
neutral words were determined. The neutral words 
used for this analysis were the ro at the beginning, 
the ro in the middle, and the 10 words at the end 
of the ent're list. The following weights of dis- 


2 The classification of the need words on the associ 
und the need concepts on the concept choice 
a short comment. The aggressive words 
obvious since they refer to 
concr behavior which in our society is labeled aggres- 
sive (viz. murder, injure, hate, kill, fight, etc. See 
I I The choice of succorant words and concepts 
is in keeping with the definition of the succorant need 
Murray. Ideas such as depend on, assist 
help refer to behavioral manifestations of 

Less obvious are concepts like breast and 
derived from psychoanalytic theory 
origin of dependency. In the case of the 
homosexual words and concepts, we made use of the 
common names of sexual organs and, in addition, some 
of the slang terms (such as fairy, queer and fruit) which 
clinical experience has shown are readily recognized as 
referring to homosexual practices. While the classifica- 
tion is somewhat arbitrary, it represents the only possible 
method of grouping the concepts in terms of content. 
Moreover, we believe that it is in complete agreement 
with the general consensus concerning the interpretation 
of Rorschach and word-association material. 


ation test 
test requires 


} 
and concepts are fairly 


as given by 
ance, and 
dependency. 
which 
about the 


nurse were 


turbance were applied to each of the need words, 
and the neutral word following it: 


Reaction time equals or exceeds the median + 
1Q=1 
Reaction time equals or exceeds the median + 
2Q=2 
The disturbance score for each of the need areas 
was obtained by summing the weights applied to 
TABLE 2 


RorscHacnH Concepts * 








Concept LocaTION 





Card I 
+H . Two fairies WwW 
—A . Sword D3 
—S 3. Cows udder Dd 31 
—H . Testicles Dd 22 
—S . Man being helped across a w v 
chasm 
+H . Two men (women) kissing D4 
+A . A woman with her head D4 
cut off 
Card X 
A child sucking on a breast D g and D6 
A man being helped across D6 V 
a chasm 
+A 10. Two animals fighting D8 
+S 11. A man suspended in air Dog 


+S 8. 
as 9. 





* A=Aggressive concept. 
S—Succorant concept. 
H=Homosexual concept. 
+ =Acceptable concept. 
—=Non-acceptable concept. 





+H 
—A 
—S 
—H 


—§ 


+A 
—S 
+S 
—S 


a 
+H 
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TABLE 2—Continued 








ConcEPT 


LocaTION 





12. A vagina 


I 


I 


3 


4 


- A woman with her head 
cut off 
. A penis 


15. A cannibal with big head 


and big lips 


16. A penis 


I 


I 


+ 
/ 


8 


. Two men fighting 


. A breast 


19. Boxing gloves 


2 


2 


NNR 
> Ww 


wv 


A 


wv 


- 


oO 


I 


we 


. Two men dancing together 

. A cannibal with big head 
and big lips 

A vagina 

. Two fairies 

. Human buttocks 


. A cow's udder 
A man lying in a drunken 
sleep 


7. Two men fighting 


. A snake 


29. Boxing gloves 


. Two persons 
quarreling 


arguing or 


. A man reclining 

. Two war clubs 

. A snake 

. A man suspended in air 


3 5. A bayonet 


3 


6. 


Testicles 

. Sword 

. A woman nursing a baby 
Bayonet 


. A dog sucking at mother’s 
breast 

. Two persons 
quarreling 


arguing or 


2. Two men (women) kissing 


A man bending over 
. Two animals fighting 
. A dog sucking at mother’s 
breast 


. Human buttocks 

. A war club 

. A breast 

. Two men dancing together 
A child sucking on a breast 


. A gun 

. A man reclining 

. A woman nursing a baby 

. A man lying in a drunken 
sleep 

. Machine guns 

. A man bending over 


Card Il 
Dd 24 
Ds 5 


D4 
Di V 


D2 
WwW 
Card Ill 


Card VII 
D2> 


WwW 


D6 
w V 
D1 
Dd 22 


Card VIII 
D7 

Dd 30 
D4 > 
D1 
DiandD4g 
Card IX 
Dd 26 
D3 V 
>t > 
w V 


Dd 21 
Dr > 





* A—Aggressive concept. 
S—Succorant concept. 
H=Homosexual concept. 
+—=Acceptadle concept. 

— =Non-acceptable concept. 
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all of the words in that need category and to all 
the neutral words which followed them. 

On the concept choice test the variables of impor- 
tance represented deviations from what the norms 
indicate are reasonable interpretations of the par- 
ticular locations in the ink blots. These deviations 
were termed rejections or acceptances. Rejections 
consisted of denials of F+ concepts; that is, if the 
subject answered, “No” to the question, “Could this 
be a ?”, and the concept was a Beck F+, he 
was credited with a rejection for the appropriate 
need area. Acceptances represented “yes” answers 
to F— concepts. An S's rejection score for any need 
area was computed by subtracting his acceptances 
from his rejections. 


REsuLTs 


Our primary concern in this experiment 
was whether or not the principle of percep- 
tual defense is operating in Rorschach per- 
formance. According to this notion, indi- 
viduals will avoid Rorschach interpretations 
which reflect ego threatening need areas. 
This indeed seems to be the case. In Table 
3 we have correlated the word association 
disturbance score for each need area with the 
rejection score for need-related Rorschach 
interpretations. 


TABLE 3 


CorRRELATION BETWEEN Worp AssociATION DisTURB- 
ANCE ScorE AND REJECTION SCORE ON THE 
RorscnacuH Concept Cuoice Test 





+ 





Propuct-MoMENT 


NEED-AREA “ 
CORRELATION 





Aggression 35 .40° 
Succorance 35 ~4qx® 
Homosexuality 35 -07 





* The significance of r is evaluated by Fisher's z 
transformation. For a one-tailed test, an r of .39 is 
significant at the .or level. 


Although the correlation for the homo- 
sexuality area is not statistically significant, 
the relationships obtained for succorance and 
aggression are significant beyond the .or level 


of confidence. For these two need areas, at 
least, the degree of ego threat (word associa- 
tion disturbance) is directly related to the 
number of need-related concepts rejected in 
the Rorschach situation. 

In Table 4 the intercorrelations of the three 
need areas have been determined within the 
word association test, within the concept 
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choice test, and between these two tests. A 
high disturbance or rejection score in one 
need area tends to be associated with high 
scores in the other two. There is also a 
tendency for the word association disturb- 
ance score for one need area to correlate with 
the rejection scores in the other need areas. 
These intercorrelations raise an important 
question concerning the basis of the relation- 
ships between disturbance and_ rejection 
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correlations. The results are similar to those 
shown in Table 3. The correlations are all 
reduced, but the relationships for succorance 
and aggression are still significant at the .o5 
level. We may reasonably conclude, there- 
fore, that in these two need areas the specific 
content of the stimulus is important. This 
finding also justifies, to some extent, the arbi- 
trary classification of content on the word 
association and concept choice tests. 


TABLE 4 


INTERCORRELATIONS 


Worp AssoOcIATION 


Agg. 


AMONG THE Neep AREAS ON THE Worp AssociATION AND Concept Cuolce Tests 


Concept CHOICE 


Suc. Homo. Agg. Suc. Homo. 





Agg. 
Suc. 
Homo. 


Word association 


Concept choice 


Agg. 
Suc. 


41 .26 -40 -13 
-53 -28 -41 
I +24 


-O1 





The N in all correlations is 35. 


For a one-tailed test, r values of .29 and .39 are significant at the .o5 and the .o1 levels. 


scores within any need area. We might ask 
the question, “Are these correlations the re- 
sult of the specific content of the need areas 
(that is, the aggressive, or succorant impli- 
cations of the words and concepts); or may 
they be explained as a general tendency for 
an individual who shows disturbance for any 
emotional material on the word association 
test to also reject more concepts on the Ror- 
schach?” In order to show that the specific 
need implications of the words in the asso- 
ciation test and the concepts in the Rorschach 
concept choice situation were important, it 
is necessary to hold constant S’s general tend- 
ency to give disturbance to emotional words 
and to reject suggested concepts. 

Therefore, a mean disturbance and rejec- 
tion score was computed for each S by aver- 
aging his scores for all of the need areas. 
His adjusted disturbance and rejection score 
for each need area was then computed as a 
deviation from the mean. These adjusted 
scores are now independent of the S’s tend- 
ency toward disturbance and rejection. By 
correlating these adjusted disturbance and re- 
jection scores for each need area we were 
able to determine the degree of relationship 
which results from the specific content of 
each of the need areas. Table 5 shows these 


TABLE 5 


CoRRELATION BETWEEN ApjusTeED Worp AssociATION 
DisrurBANCE Scores AND Apyustep REJECTION 
Scores ON THE Concept Cuotce Test * 


. Propuct-MoMENT 
Neep AREAS ~ 
CoRRELATION 
Ager ssion 
Succorance 
Homosexuality 





For a one-tailed test an r of .29 is significant at the 


05 level. 
* See text for a definition of adjusted score. 


The word association disturbance scores 
used in all the analyses thus far were based 
upon association time. We have also analyzed 
performance on this test in terms of S’s re- 
sponse words. For each S, we counted the 
number of response words with aggressive, 
succorant, and homosexual meanings, respec- 
tively. In line with a theory of perceptual 
defense, we would expect the number of 
need-related response words to be inversely 
related to the degree of ego threat. In other 
words, if a person is severely threatened by 
aggressive ideas, we would expect him to 
give few verbal associations bearing this 
stamp. Table 6 presents the correlations ob- 
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tained between the number of need-related 
response words and the disturbance and re- 
jection scores in the corresponding need areas. 
Although none of the correlations attain sta- 
tistical significance, all six of them are in the 
predicted negative direction. 


TABLE 6 


CORRELATION BETWEEN Number OF Neep RELATED 
Response Worps aNv DisTuRBANCE AND ReEJECc- 
TION SCORES FOR THE CORRESPONDING 
Neep AREA 








ConcePT 
CHOICE 
REJECTION 
Score 


Worp 
ASSOCIATION 
DIsTURBANCE 
Score 


NEED AREA 





Aggression 35 -18 
Succorance 35 -07 
Homosexuality 35 -04 


—.14 
—.13 


—.22 





For a one-tailed test an r of .29 is significant at the 
.05 level. 


Discussion 

The main question to which this experi- 
ment was addressed has been answered af- 
firmatively. The tendency to reject (or avoid 
seeing) Rorschach concepts is significantly 
related to emotional disturbance in the con- 
tent area with which that concept communi- 
cates. This does not mean that any time S 
fails to see something in the ink-blots he is 
emotionally threatened by it. Lack of experi- 
ence with, or the absence of interest in the 
concept is still apt to be important. It does, 
however, provide clear experimental evidence 
that such failures can often be related to ego 
threat. This is particularly relevant in the 
testing of the limits situation of the Ror- 
schach test where the subject is actually 
forced to deal with or reject certain kinds of 
percepts. We believe, however, that this 
principle is not limited to the Rorschach 
test. Eriksen (3) has shown that perceptual 
defense is an important factor in TAT stories. 
Failure to give certain interpretations in any 
projective test situation provides clues about 
areas of emotional disturbance. 

In surveying the data we have presented, 
the question arises why the relationships, 
which were strong in the aggressive and suc- 
corant areas, were so poor in the area of 
homosexuality. We would offer one expla- 
nation, after the fact. In looking over the 
Rorschach concepts which we chose to rep- 
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resent homosexual ideas, we have come to 
the conclusion that our choices were poor 
ones. Instead of tapping only .homosexual 
threats we have communicated with the sex- 
ual area in general. While the word “penis” 
is probably a source of anxiety for a homo- 
sexual, it is also likely to be anxiety producing 
for anyone with problems concerning sexual- 
ity. The area is therefore a heterogeneous 
one, and only minimally useful for identify- 
ing homosexual threat. On the other hand, 
the word association test is much more spe- 
cific, making use of such terms as “fairy.” 
We would guess that the lack of relationship 
obtained for the homosexual area does not 
necessarily reflect a true lack of correlation, 
but is rather an artifact of our experimental 
situation. 

A final comment is in order concerning 
the size of our relationships for aggression 
and succorance. While they are not high, 
they are indeed moderate. We were rather 
surprised when they turned up as* strongly 
as they did in view of the large errors of 
measurement we had to contend with in an 
experiment of this sort. Our measurement 
would have been more accurate if we had 
been able to obtain Rorschach norms for our 
own population sample, rather than have to 
depend upon the very limited norms avail- 
able in Beck. Moreover, since the number 
of concepts rejected by most of the Ss was 
not in itself large, there was a decidedly re- 
stricted range of variation which we had to 
work with. However, we were never really 
interested in determining the precise degree 
of relationship between emotional disturb- 
ance and rejections of Rorschach concepts. 
We believe that it is most important to have 
shown that there is such a correlation. It 
is quite possible that more careful measure- 
ment and the use of other supplementary 
procedures might have produced a set of cor- 
relations which would have practical useful- 
ness as a diagnostic test. 

At the moment, we are satisfied to bridge 
the gap between laboratory experiments with 
perceptual selectivity and projective pro- 
cedures in the clinic. 


SUMMARY 


In this paper we have attempted to relate 
some of the concepts which have developed 
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out of need and perception research to pro- 
jective testing. 

In the experiment which we have de- 
scribed, 35 psychoneurotic patients and col- 
lege students were administered a special 
word association test containing aggressive, 
succorant, and homosexual words. In addi- 
tion, a Rorschach concept choice test which 
resembled the testing the limits technique 
was given each subject. 

The data have shown that disturbance 
scores on the word association test for aggres- 
sion and succorance were significantly related 
to the tendency of subjects to reject cor- 
responding concepts on the Rorschach test. 
The experiment confirmed the familiar clin- 
ical belief that failure to make certain kinds 
of interpretations of projective material may 
reflect emotional disturbance toward that ma- 
terial. 


. Vanverpias, J. M., & Brake, R. R. 


REFERENCES 

. Beck, S. J. Rorschach’s test. New York: Grune & 
Stratton, 1944, Vol. 1. 

. Bruner, J., & Postman, L. Emotional selectivity in 
perception and reaction. /}. Pers., 1947, 16, 
69-77. 

. Ertxsen, C. W. Some implications for TAT inter- 
pretation arising from need and perception ex- 
periments. |. Pers., 1951, 19, 282-288. 

. Ertxsen, C. W. Perceptual defense as a function of 
unacceptable needs. |. abnorm. soc. Psychol., 
1951, 46, 557-564. 

. Lazarus, R. S., Ermxsen, C. W., & Fonpa, C. P. 
Personality dynamics and auditory perceptual 
recognition. ]. Pers., 1951, 19, 471-482. 

. McCLeianp, D. C., & Lrperman, A. M. The effect 
of need for achievement on recognition for need- 
related words. ]. Pers., 1949, 18, 236-251. 

. McReynotps, P. The Rorschach concept evaluation 
technique. Amer. Psychologist, 1949, 4, 267. 
( Abstract.) 


. Postman, L., Bruner, J., & McGinnies, E. Per- 


sonal values as selective factors in perception. 
]. abnorm. soc. Psychol., 1948, 43, 142-154. 
Selective 
sensitization in auditory perception. |. Pers., 
1949, 18, 252-256. 


Received by this Journar, June 2, 1951. 





THE EFFECTS OF PREVIOUS TRAINING ON POSTFRUSTRATION 
BEHAVIOR * 


JOEL R. DAVITZ 


Teachers College, Columbia Unwwersity 


has been central in both mental hygiene 

and social psychology. Psychologists and 
other social scientists have shown their inter- 
est in frustration and its effects by developing 
several theories, by performing a number of 
experiments and by using the concept of frus- 
tration to account for the deviant behavior of 
persons and of social groups. In spite of this 
extensive interest, many issues relating to the 
effects of frustration remain unresolved and 
the current theories remain incompletely 
tested. 

The research reported here deals with the 
influence of one variable upon reactions to 
frustration. The principal hypothesis is that a 
person’s response to frustration will be affected 
by his previous experience in situations similar 
to that in which frustration is encountered. 
Specifically, the experiment studies the dif- 
ferential effects of aggressive training and of 
constructive training on the responses to frus- 
tration made by children seven to nine years 
of age. 

In order to develop the theoretical rationale 
for the present study, three major theories of 
frustration presented in the recent literature 
will be critically reviewed. The frustration- 
aggression hypothesis, which assumes a high 
degree of correlation between frustration and 
an “instigation to aggression,” interprets post- 
frustration behavior in terms of direct ag- 
gression, displaced aggression, or substitute 
activity (3, 4). That direct or indirect aggres- 
sion is not a universal response to frustration 
has been widely discussed in the literature 
of frustration-aggression criticism. Thus, the 
category of “substitute activity” necessarily 
covers an extremely wide range of behavior. 
The interpretation of this wide range of non- 
aggressive behavior as substitute activity, 


I’ recent years the concept of frustration 


1The writer wishes to express his appreciation to 
Professors Irving Lorge, Laurance F. Shaffer, Edward J. 
Shoben, Jr., and Lincoln Moses of Teachers College, 
Columbia University, for their invaluable assistance in 
the present study. 
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“substitute” only from the perspective of a 
theoretician anticipating aggression, requires 
the manipulation of unverified inferred vari- 
ables, and does not provide an adequate gen- 
eral theoretical framework within which to 
analyze, predict, and understand this be- 
havior. 

The frustration-regression theory suggests 
that the change of behavior which occurs 
after frustration is predominantly in the di- 
rection of regression (1). While there is little 
question that regression may occur as a result 
of frustration, a change of behavior in this 
direction is certainly not the only possible 
change in direction. There are instances in 
the classroom every day in which the 
change of behavior after frustration is in the 
direction of growth rather than regression. 
There would seem to be no general factor 
in frustration per se which determines the 
direction of behavioral change, and the a 
priori assumption that the change is always 
in one direction is not consistent with every- 
day observations. Therefore, because it pre- 
dicts a change of behavior after frustration 
in only one direction, viz. regression, the 
frustration-regression theory does not provide 
a general theory of frustration. 

Maier has suggested that postfrustration 
behavior is nonmotivated behavior without 
a goal (6). This challenge of the postulate 
that all behavior is motivated rests on Maier’s 
experiments with animals forced to respond 
in insoluble problem situations. Maier ob- 
served that these animals developed consistent 
patterns of behavior, yet there was no appar- 
ent goal, and he concludes that postfrustra- 
tion behavior is qualitatively different from 
motivated behavior because it is behavior 
without a goal. Maier’s theoretical position 
restricts the definition of motivation to goal- 
seeking, neglecting the widely recognized 
definition of motivation in terms of ante- 
cedent conditions. The chief criticism of 
Maier’s position is that his basic experimental 
data, upon which he bases his fundamental 
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postulate of motivated and nonmotivated be- 
havior, can be adequately interpreted and 
predicted in terms of avoidance behavior, 
utilizing the concepts of learning theory and 
without invoking a new theoretical sphere of 
nonmotivated behavior. 

Having briefly examined these theories of 
frustration, it is suggested that a general 
theory of frustration cannot be restricted to 
the prediction of a particular mode of re- 
sponse or to a particular directional change of 
behavior after frustration. This critical ex- 
amination also suggests that frustration 
theory may be most fruitfully treated in 
terms of a more general theory of adjust- 
ment, rather than in terms specific to frus- 
tration alone. 

It is suggested that postfrustration behavior 
tends toward adjustment, and that the process 
of adjustment may be analyzed in terms of 
learning theory as suggested by Shaffer (10) 
and Miller and Dollard (7). For purposes of 
this study, frustration is defined as the bleck- 
ing of drive-evoked behavior. When this 
behavior is blocked and the drive continues, 
the cumulative intensification of the drive 
evokes an emotional response. The particu- 


lar pattern of behavior evidenced by the or- 
ganism is a function of the organism’s 
hierarchy of responses related to the emo- 
tional stimulation and the particular situation 


in which frustration is encountered. While 
it is suggested that this hierarchy of responses 
is a significant determinant of the organism’s 
postfrustration behavior, it is recognized that 
this is not the only determining factor. The 
intensity of the original frustrated drive and 
the resultant emotional response, the degree 
to which the original drive-evoking situation 
continues to impinge upon the organism, and 
the degree of active punishment involved in 
the frustrating circumstances may be several 
other factors involved in this complex process. 

Frustration theory is treated as merely one 
case of a general adjustment theory employ- 
ing the concepts of learning theory. No spe- 
cific behavioral responses are predicted as gen- 
eral results of frustration nor is the direction 
of behavioral change, either in terms of re- 
gression or growth, suggested as a general 
rule. As in all cases of behavior, the analysis 
of postfrustration behavior involves the inter- 
action of a particular organism and a par- 
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ticular situation, and the theoretical frame- 
work suggests general relationships among 
the various factors involved in this interac- 
tion. 

The principal focus of the present research 
was the development of differential response 
tendencies, prior to subjecting the subjects 
(Ss) to frustration. One group of Ss was 
trained aggressively before frustration, and 
another group was trained to act construc- 
tively. All Ss were trained in the same physi- 
cal setting as that in which the effects of the 
frustration were observed. It was assumed 
that the training received would develop in 
the individuals of each group a specific be- 
havioral tendency related to that physical 
situation, and it was hypothesized that their 
learned behavioral tendencies would dif- 
ferentially affect the behavior of each of the 
groups following frustration. 

The two major hypotheses of the study 
were: 

1. Subjects trained aggressively will behave 
more aggressively after frustration than will 
subjects trained constructively. 

2. Subjects trained constructively will be- 
have more constructively after frustration 
than will subjects trained aggressively. 


Procedure 


The experimental procedure may be divided into 
four major sections: (a) free play; (4) training; 
(c) frustration; and (d) free play. In the first ex- 
perimental session each of the ten groups was 
allowed free play with any of the materials in the 
experimental playroom. This was followed by a 
series of training sessions in which five groups were 
trained aggressively and five groups were trained 
constructively. The final sequence of the experi- 
ment consisted of a frustrating situation followed 
by a second period of free play. 

The experimental population consisted of 40 
subjects, 24 girls and 16 boys between the ages of 
seven and nine, selected from a group of children 
in residence at a summer camp. The mean age of 
the subjects was 100 months and the standard 
deviation of age was nine months. The total 
population was divided into five pairs of experimen- 
tal groups (10 groups), each pair matched on the 
basis of age and sex. There were four Ss in each 
group, and the particular type of training, aggressive 
or constructive, assigned to each group was deter- 
mined in a random manner. 

The playroom was twelve feet long and nine feet 
wide, and it was bounded on one end by a black 
wire screen which permitted cameras, placed at a 
small opening in a wall fourteen feet from the 
screen, to record the behavior during the pre- and 
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postfrustration play sessions. The play materials, 
which were arranged in an identical fashion before 
each free play session, consisted of clay, three dolls, 
building logs, dump truck, large plastic punching 
doll, hammer, saw, nails, and wood. Each group 
received seven thirty-minute training sessions. 

Aggressive training is defined as that which en- 
courages and rewards behavior the goal of which 
is injury to some object or person. The aggressive 
training was a series of games designated as: Cover 
the Spot, Scalp, and Break the Ball. These games 
are briefly outlined below. 

Cover the Spot: At the center of a mat placed 
on the floor was a small x marked in black chalk. 
Each S was instructed to cover the spot with some 
part of his body, and that person covering the spot 
at the end of the game was the winner. Only one 
person could cover the spot at one time, and it 
was emphasized to the Ss that there were no rules 
limiting their aggressive behavior during the game. 

Scalp: A piece of cloth was tied around the arm 
of each subject and he was informed that this was 
his scalp. The object of the game was to tear the 
scalps from the other S’s arm while protecting one’s 
own scalp. 

Break the Ball: Each S was provded with a 
ping pong ball which was placed on the floor and 
could not be touched by hand. The object of the 
game was to break everyone else’s ping pong ball 
while protecting one’s own ball. 

The several games described above were played 
for a period of ten minutes and repeated during 
the seven training sessions. During these training 
sessions a chart for each group was kept on the wall, 
and the winner of each game was awarded a star 
on this chart. Throughout these sessions aggressive 
behavior was praised and encouraged by the experi- 
menter (E), and, in general, there was a high de- 
gree of aggressive behavior evidenced. 

Constructive training is defined as that which 
encourages and rewards behavior involving the use 
of materials for the construction of designated 
objects. The constructive training consisted of 
drawing murals and completing jigsaw puzzles. 


TABLE | 


EXPERIMENTAL PROCEDURE 








SecTION SESSION PROCEDURE TIME 





Prefrustration free play 18” 
Training* 30° 
Training 30° 
Training 30° 
Training 30° 
Training 30" 
Training 30" 
Training 30° 
Frustration 15° 
Postfrustration free play 18’ 


AVaWN 


P ey 


Il. 
IV. 


~ 





* During the training sessions (2-8), five groups of Ss 
were trained aggressively and five groups of Ss were 
trained constructively. 
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During four sessions a long sheet of paper was 
placed on the wall and a box of crayons was placed 
on the floor. Instructions were given to draw a 
single picture on the entire sheet of paper, and E 
emphasized the constructiveness of each S as well as 
the cooperation of the group. During the remain- 
ing sessions, each group was presented with a jig- 
saw puzzle containing thirty pieces. The Ss were 
told that if the pieces were put together correctly, 
they would form a picture of American Indians 
or a familiar fictional character. When the group 
completed the first puzzle, a second puzzle was 
provided, and this was continued until the end of 
each training period. Throughout the training 
periods all aggressive behavior was discouraged by 
E, while constructiveness, on the other hand, was 
praised and encouraged. 

The final phase of the experiment consisted of 
the frustration and a second free play session. The 
Ss were seated on the floor next to a projector out- 
side of the playroom and told that they were to see 
movies. Five reels of film, arranged next,to the 
projector, were contained in boxes which displayed 
the titles of the film and a picture of the leading 
character. ‘The first reel was shown completely. 
At the start of the second reel, each S was given 
a bar of candy, and at the climactic point of the 
film, E stepped in front of the seated Ss, removed 
the candy from their hands, and ushered them into 
the playroom. As E locked the screen door of 
the playroom, he made the following statement, 
“You cannot have any more candy or see any 
more films, but you can play with anything in the 
room.” Although the Ss could see the projector, 
which continued to run, through the screen door, 
the movie screen was not visible. E did not answer 
any questions and made no comment on behavior. 
In no case was there any contact between groups 
during the final phase of the experiment. 

The behavior evidenced in the pre- and post- 
frustration free play sessions was recorded on mov- 
ing picture film for a period of eighteen minutes, 
starting in each case with the moment the Ss 
entered the playroom. Two eight-mm. cameras 
with automatic self-winding devices and wide angle 
lenses were used for this purpose. In order to 
analyze the data, the pre- and postfrustration be- 
havior recorded on film was observed and written 
protocols of the behavior of each S were made. 
Pre- and postfrustration periods were not identified 
and the observations were made independently by 
two observers. During each viewing of the film 
the observers made a continuous record of a single 
S. These protocols were compared, differences be- 
tween observers noted, and the procedure was re- 
peated until agreement between observers was 
reached concerning the behavior of each S. 


RESULTs 


The data were analyzed in terms of the 
two major hypotheses, and the statistical 
analysis pertaining to each hypothesis will 
be presented separately. 
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Aggressiveness 


It is hypothesized that Ss trained aggres- 
sively will evidence more aggressive behavior 
after frustration than will Ss trained con- 
structively. The eighty protocols of be- 
havior, including the records of pre- and 
postfrustration behavior of all forty Ss, were 
presented to four judges, who were asked to 
rank the protocols in order of aggressiveness. 
The playroom and materials were described 
however, the pre- and post- 
frustration and the individual Ss 
were not identified. The eighty protocols 
were presented in random order, and the 
protocols were ranked independently by each 
judge. The judges were doctoral students in 
psychology who had previous experience in 
ranking procedures. 

The agreement among judges was deter- 
mined by the coefficient of concordance as 
presented by Kendall (5, p. 61). All four 
judges agreed that twenty protocols evidenced 
no aggression and could not be ranked along 
the aggression continuum. Including these 
protocols in the evaluation of agreement 
among judges would result in a spuriously 


to the judge Ss; 


sessions 


high coefficient of concordance, since the con- 
cordance of judges on protocols not on the 
continuum is +1. Therefore, in computing 
the coefficient of concordance these protocols 


were omitted. The value of the coefficient 
of concordance, corrected for ties, for the 
four rankings of sixty protocols was found 
to be .g03,” indicating an extremely high de- 
gree of agreement among judges. 

In the following analysis of the data, the 
pre- and postfrustration ranks of each S were 
determined by summing the ranks assigned 
to each protocol by the four judges and ar- 
ranging these summed ranks in order from 
the lowest to the highest sum. It should be 

2 Since there were a number of ties within the rank 


order of each rater, the computation of W must correct 


for these tics. 
AY } 


\ | | eee 


’ 
on 

1 (m) [ (m?—n)—m?, | 
— uL 


12 
is the coefficient of concordance; m 
n is the number of ranked protocols; 


deviations from the 


where W is the 
number of raters; 
S is the sum of the squares of 
mean rank; 

J _r ZF (t8—t) 

{r= , 

12 t 

where ¢ is the number of ties in the rank order of 
each rater. 
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noted that the form of the population of 
ranks cannot be specified; therefore, the 
analysis of the data is in terms of non-para- 
metric inference. 

In order to test the major hypothesis con- 
cerning aggressiveness, the pre- and postfrus- 
tration ranks of each S were compared, and a 
gain of rank was indicated by a +; a loss 
in rank, by a —. The null hypothesis which 
was tested may be stated as follows: The 
probability of gains after frustration of the 
aggressively trained group is less than, or 
equal to, the probability of gains of the con- 
structively trained group. ‘The null hypoth- 
esis would be rejected for a high number of 
gains in the aggressive group. 

Only pairs of ranks (the pre- and post- 
frustration ranks of each individual) were 
considered. Therefore, these observations may 
be treated as independent, and the significance 
of the difference of gains and losses between 
the two groups may be tested by the ordinary 
method of chi-square. In computing chi- 
square, the ties for each group were split, half 
added to the number of gains and half to the 
number of losses in each group. This pro- 
cedure increased the numerator of chi-square, 
thus providing the most conservative estimate 
of probability. The obtained chi-square cor- 
rected for continuity is 3.63. However, the 
hypothesis under consideration is a one-sided 
hypothesis and the chi-square value is in 
terms of a two-sided hypothesis. For mod- 
erate samples, if chi-square has one degree of 
freedom, the square root of chi-square has 
a distribution which is the right hand half 
of a normal distribution. Therefore, in order 
to test the present one-sided hypothesis, the 
chi-square value must be converted into the 
equivalent value in terms of a unit normal 
deviate. This value is 1.90; therefore, the null 
hypothesis is rejected at the .05 level of sig- 
nificance. It is concluded that aggressively 
trained Ss behaved more aggressively after 
frustration than constructively trained Ss. 

The equivalence of the location of pre- 
frustration ranks of the two groups is evalu- 
ated by the median test as presented by Mood 
(8, p. 394). The hypothesis that the two 
population medians are equal is tested. The 
resulting value of chi-square is 1.6 and 
the hypothesis cannot be rejected even at the 
.10 level of significance. Therefore, we may 
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conclude that in terms of the location in the 
rank order of aggressiveness before frustra- 
tion, the aggressive and constructive groups 
were equivalent. 


Constructiveness 


It is hypothesized that the constructively 
trained Ss will behave more constructively 
after frustration than the aggressively trained 
subjects. The analysis of the data in terms 
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The pre- and postfrustration protocols were 
arranged in rank order of constructiveness by 
taking the sum of ranks assigned to each 
protocol by the five judges. Correcting for 
continuity, the resulting value of chi-square 
was 5.10. This value was converted to the 
corresponding value in terms of a unit nor- 
mal deviate and the value 2.25 was obtained. 
Therefore, the null hypothesis, that the prob- 
ability of gains of the constructively trained 


TABLE 2 


RANK ORDER OF AGGRESSIVENESS, PRE- AND PosTFRUSTRATION 








CONSTRUCTIVE 
Subject ————— Dir FERENCE® 
PRE Post 


AGGRESSIVE 


Susyecr DiFFERENCE® 





40 48 
38 70 
59 7° 
70 70 
8 7 
31 35 
9 5- 
34 47 
19 70 
20. 39 
23. 70 
22 51 
26 15. 
20. 43 
29. 13 
32 25 
70 70 
70 70 
7° 29-5 
4! 70 


On AUS Wh 
I+l+ol 


1 | 


| 


+ 
oa 
oa 
° 
°o 
+ 


++t++4++4+1 oo L+tt4+1 1444 





Gains: 6 
Losses: 11 
Ties: 3 


Gains: 14 
Losses: 
Ties: 1 





® + indicates gain in rank; — indicates loss in rank; 0 indicates no change in rank. 


of this hypothesis is essentially the same as 
the procedure discussed above in terms of 
aggressiveness. The eighty protocols were 
ranked from most to least constructive by five 
judges, and the agreement among judges was 
determined by the coefficient of concordance. 
The value of the coefficient of concordance 
was found to be .904,’ indicating an extremely 
high degree of agreement among judges. 


8 The coefficient of concordance, W, is obtained by 


. 12S 
w=—-—., 
m?(n3—n) 
the coefficient of concordance; m is the number of raters; 
n is the number of protocols ranked; S is the sum of 
the squares of deviations from the mean rank. 


the following formula where W is 


group is less than, or equal to, the probabil- 
ity of gains of the aggressively trained group, 
was rejected at the .o2 level of significance. 
It may be concluded that the constructively 
trained Ss behaved more constructively after 
frustration than the aggressively trained Ss. 

The equivalence of the location of pre- 
frustration ranks of the two groups is eval- 
uated by the median test. The hypothesis 
that the population medians are equal is 
tested. The resulting value of chi-square is 
0.4, and the hypothesis cannot be rejected 
even at the .10 level of significance. There- 
fore, it may be concluded that in terms of 
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location in the rank order of constructiveness 


before frustration, the constructive and ag- 
gressive groups were equivalent. 


CoNcLUusIONS 
Both of the experimental hypotheses were 
supported by the experimental results. There- 
fore, it may be concluded that under the con- 
ditions specified in the present experiment 
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The evidence countering the frustration-ag- 
gression hypothesis is even more striking 
when the constructively trained group is con- 
sidered; 11 of the 20 Ss in this group behaved 
less aggressively after frustration, while only 
six behaved more aggressively. Furthermore, 
it does not seem reasonable to interpret the gen- 
eral decrease of aggressiveness evidenced by 
the constructive group and the increase of 
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RANK Orpver oF CoNsTRUCTIVENESS, PRE- AND PosTFRUSTRATION 








CONSTRUCTIVELY TRAINED 


AGGRESSIVELY TRAINED 





DIRECTION OF 
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Gains: 1 


Losses: 8 


Gains: 
Losses: 





* + indicates gain in rank; — indicates loss in rank. 


previous training in situations similar to that 
in which frustration is encountered is a sig- 
nificant determinant of the organism’s post- 
frustration behavior. These results are in 
contrast with past studies of frustration which 
have interpreted postfrustration behavior 
primarily in terms of the frustrating situation 
itself. 

The experimental results do not seem to 
be consistent with the frustration-aggression 
hypothesis as a general theory of frustra- 
tion. While fourteen of the Ss of the ag- 
gressively trained group behaved more ag- 
gressively after frustration, the postfrustration 
behavior of five subjects was ranked as less 
aggressive than their prefrustration behavior. 


constructiveness evidenced by 12 members of 
this group as “substitute activity.” It would 
seem to be more consistent with objective 
psychological theory to interpret these results 
in terms of a general theory of adjustment 
rather than in terms of a specific mode of 
response such as aggression. 

The frustration-regression theory suggested 
by Barker, Dembo and Lewin (1), and 
Maier’s (6) interpretation of postfrustration 
behavior as behavior without a goal, are not 
supported by the data. While growth and 
constructiveness as defined in this study are 
not synonymous, there is a close relationship 
between these two concepts. Therefore, to 
interpret the increase of constructiveness as 
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evidenced by 16 of the 40 subjects in this 
experiment in terms of regression or behavior 
without a goal does not seem to be a valid 
theoretical procedure. While 22 Ss did evi- 
dence less constructiveness after frustration, it 
is obvious that the change of behavior is not 
necessarily in the direction of regression. 
Previous training is at least one factor which 
determines this change of direction. 

It should be noted that in the experiment 
presented here the two major hypotheses are 
interdependent. -The data used for testing 
the two hypotheses were obtained from the 
same population and the behavior of each 
S was treated in terms of aggressiveness and 
constructiveness. All other things being 
equal, a high degree of constructiveness was 
associated with a low degree of aggressive- 
ness, and vice versa, as indicated by a rank 
order correlation of —.83 between aggressive 
and constructive ranks of the 40 subjects. 

The effects of prefrustration training in 
this experiment were not invariant. Six in- 
dividuals within the constructively trained 
group behaved more aggressively after frus- 
tration and four Ss in the aggressively trained 
group behaved more constructively after frus- 
tration. This indicates that while the ex- 
perimental training was a significant factor 
in terms of the behavior of the group, the 
total past history of the individual must be 
considered in predicting and understanding 
his behavior after frustration. 

It has been demonstrated that postfrustra- 


tion behavior cannot be treated only in terms | 


of the stimulus conditions associated with the 
frustration. The external stimulus conditions 
of the frustrating situation were identical for 
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the aggressive and constructive groups, while 
the previous training of the groups differed. 
Therefore, in this case the previous training 
was the significant factor which determined 
the differences of the change of behavior in 
the two groups after frustration. It is sug- 
gested that previous experience in situations 
similar to that in which frustration occurs is 
one factor which must be considered in the 
understanding of postfrustration behavior. 

The results of this single experiment are 
presented neither as conclusive evidence of 
the inadequacy of present frustration theories 
nor as final evidence of the effects of previous 
experience on postfrustration behavior. For 
the writer, the most significant result is a 
realization of the need for further experimen- 
tal study in this area. 
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RELIGIOUS ETHNOCENTRISM AND ITS RECOGNITION 
AMONG ADOLESCENT BOYS * 


R. E. GOODNOW anv R. TAGIURI 
Harvard University 


on the existence of religious ethno- 

centrism but little on its recognition. 
Do the groups excluded by others’ ethno- 
centrism underestimate or show accurate 
perception of their exclusion? Do the ethno- 
centric groups accurately recognize their 
own self-preference and the extent to which 
they exclude others? These questions are 
basic to understanding ethnocentrism. 

The present study employs a technique 
that obtains measures of recognition and of 
ethnocentrism simultaneously and from the 
same subjects. The subjects are Protestant, 
Roman Catholic, and Jewish students in a 
boys’ preparatory school offering the four 
high school years. Most of the boys come 
froma homes in the upper-middle and upper 
socioeconomic levels, although an appreciable 
number of students are on part or full 
scholarships. A statement of religious affili- 
ation is not requested on the application, and 
there is no “quota” system. Within the 
school the “visibility” of the religious minor- 
ity groups is reduced to a minimum. There 
are no special religious services for Catholics 
and Jews, no special dietary provisions, no 
observance of Jewish holidays, no segregation 
in living arrangements. The school attempts 
to satisfy as much as possible the student's 
choice of roommate. The living arrange- 
ments and school activities make it possible 
for all students to have almost the same 
opportunities for contact with the other 
students. 


A GREAT deal of research has been done 


METHOD 


The data were obtained during one of the daily 
assemblies. The Dean of Students announced that 
he was interested in finding out to what extent the 
administration could satisfy the roommate prefer- 
ences of the students. Each student was given a 
3 X § inch card. The Dean then instructed the 
students as follows: 

1TLis paper is a by-product of a larger study of 
social interaction of adolescents sponsored by the Grant 
Foundation, Inc. 


“On the left half of the card, write the names 
of the students you would like to have as room- 
mate ext year. Use as many or as few names as 
you like.” 

He then allowed four minutes to elapse, after which 
he added: 

“On the right half of the card, now write the 
names of those students whom you guess would like 
to have you as a roommate. Use as many or as 
few names as you like.” 

After another four minutes, he added that those 
who would prefer rooming alone should put a cross 
at the lower left of the card. 

The first part of the task, choosing roommate, is 
a standard sociometric procedure. The second part, 
the guessing, is an addition to the standard method. 
It permits an investigation of the student's recog- 
nition of his social situation. An important feature 
of the procedure is that there was not the slightest 
suggestion that the information had anything to do 
with religious differences. The data were, in fact, 
obtained for another purpose, and the possibility of 
making a special study of ethnocentric choices was 
an afterthought. Another important feature is that 
the activity fitted naturally into the school situation 
and was of considerable importance to every sub- 
ject. 


REsuLts AND Discussion 2 


Three of the measures obtained from the 
procedure will be considered here. 

1. Choices made by a student, which indi- 
cate the roommates the student would like 
to have. 

2. Choices received by a student, which 
refer to the other students who would like 
to room with him. 

3. Guesses, which refer to the choices a 
student believes he is going to receive. 

The first step in the analysis demonstrates 
that ethnocentrism exists. If the members 
of the three religious groups did not show 
ethnocentrism in their choices, we would 
expect a correspondence betwen the distri- 
bution of choices made among the groups 
and the distribution of the students according 
to religious affiliation. In other words, each 

2 Details on this procedure and its analysis will be 
found in R. Tagiuri’s Relational analysis: An extension 


of sociometric method with emphasis upon social percep- 
tion. Sociometry, 14, Nos. 1-2, 1952. 
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TABLE 1 
DisTRIBUTION OF CHOICES 








PROTESTANTS 


R.C.’s Jews Tora 





Group 
N Per cent * 


N Per cent * N Per cent * Per cent * 





Choices made by Protestants of: 83. 
Choices received by Protestants from: 82. 
Choices made by R.C."s of: 77- 
Choices received by R.C.’s from: 74- 
Choices made by Jews of: 59- 
Choices received by Jews from: 94 55- 


245 11.7 ; 100.0 
250 11.7 _ 100.0 
62 19.3 c , 100.0 
62 18.7 ’ 100.0 
24 10.9 " 100.0 
9 5-4 - ° 100.0 








Number in each group 496 78. 


79 12.7 . 100.0 





* All percentages represent per cent of total entered in the extreme right column. 


choice distribution should approximate the 
population distribution which is 78.0 per cent 
Protestant, 12.7 per cent Roman Catholic, and 
9-3 per cent Jewish. If, however, the distri- 
bution of the choices of a group is biased to 
the extent of differing significantly from these 
values, we would conclude that ethnocentrism 


is present. Such bias may take the form of 
preference for members of one’s own group, 
exclusion of members of another group, or 
both. 

The choices made by the groups are com- 
pared with the representation of the groups 
in the population in Table 3. 


TABLE 2 
DisTRIBUTION OF GUESSES 











PROTESTANTS 


R.C.'s Jews Tora 





N Per cent ® 


N Per cent * Percent® WN Per cent * 





Guesses made by Protestants of: 83.8 
Guesses made by R.C.’s of: 75.2 
Guesses made by Jews of: 69 51.9 


146 11.7 4-5 1245 100.0 
41 21.6 3.2 190 100.0 
II 8.3 39.8 133 100.0 








Number in each group 496 78.0 


79 12.7 9.3 634 100.0 





* All percentages represent per cent of total entered in the extreme right column. 


TABLE 3 


Groups’ CuHoices CoMPARED wiTH Groups’ REPRESENTATION IN THE EXPERIMENTAL PoPULATION 








' 

| Per Cent 
Group OF 

CHOICES 


Per Cent 
OF DIFFERENCE 
PopuULATION 








Protestant choice of: 
Protestants 83.8 
Catholics 11.7 
Jews 4-5 

° 


100.0 (N=2091) 


78.0 +5.8* 
12.7 —1.0 
9-3 —4.8* 
100.0 (N=634) 








R.C.'s choice of: 
Protestants 77. 
Catholics 19. 
Jews 2. 


Totals 100.0 (N=321) 


78.0 
12.7 
9-3 
100.0 (N=634) 





Jewish choice of: 
Protestants 59-7 
Catholics 10.9 
Jews 29.4 
Totals 100.0 (N=221) 





78.0 
12.7 
9-3 
100.0 (N=634) 





in the popul 
in the popul 





+ indicates choosing in excess of repr 
— indicates choosing less than rep 
® significant at 19% level. 








2 naar CL Heat 
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It is clear from the table that bias is present. 
In detail: 

1. Each group chooses a larger percentage 
of its own members than its percentage of 
representation in the population. The differ- 
ences are significant at the .o1 level.* 


R. E. Goopnow anp R. Tacruri 


2. In choosing members of the other 
groups, (a) Protestants choose significantly 
fewer Jews but not significantly fewer Catho- 
lics than the expected population proportions. 
(4) Catholics choose significantly fewer Jews 
but not significantly fewer Protestants than 


TABLE 4 


Group 








Cuoices RECEIVED DIFFERENCE 





Protestants in relation to: 
Protestants 
R.C.'s 
Jews 


Total 





Catholics in relation to: 
Protestants 
R.C..'s 
Jews 


Total 





Jews in relation to: 


Protestants 
R.C.’s 
Jews 





-++ indicates more choices made than received. 
— indicates fewer choices made than received. 
®* significant at 1% level. 


8 All tests of statistical significance were performed with 
the binomial probability paper developed by Mosteller 
and Tukey (1). This technique permitted throughout 
uniform and quick estimates of statistical significance. 
The writers are cognizant of the fact that the empirical 
data do not meet all the assumptions underlying this 
method of significance testing. One may wonder whether 
the results would still hold with stricter and perhaps 
more appropriate statistics. The writers have explored 
such possibility in the case of Roman Catholics’ choices 
of Catholics (cf. Table 3). Here the absolute difference 
between the expected and observed percentages is not 
large (6.6 per cent), and the size of the population in- 
volved is relatively small (N = 79). 

The procedure used involved the computation of the 
proportion (pis) of choices directed by each subject (7) 
7H 

, where zx is 
i 
the number of choices directed to Roman Catholics, and 
i is the number of choices in the fth stratum. We then 
combined the pij’s into an over-all mean proportion 


kN | 
Ss pit 


in a choice stratum (7), so that psy = 


Nip 


on(En)} 


obtained this way was .196 
(vs. .193 of Table 3). The value of the critical ratio 
was 7.55 (probability .oo1). This confirms and 
strengthens the findings obtained by our procedure. 





The observed proportion 


expected. (¢) Jews choose significantly fewer 
Protestants but not significantly fewer Catho- 
lics than expected. 

Both Protestants and Catholics are ethno- 
centric with respect to the Jews while the 
latter are ethnocentric with respect to the 
Protestants. There is no significant bias, 
however, against the Catholic minority. Yet 
the Catholic and Jewish minorities are com- 
parable proportions of the population (12.7 
and 9.3 per cent). 

Although all groups prefer their own mem- 
bers and to some extent withhold choices 
from others, the question arises: Is one group 
excluded more than another? The question 
is answered in Table 4, which shows that 
(a) Protestants choose significantly fewer 
Jews but not significantly fewer Catholics 
than those groups, in turn, choose Protes- 
tants; (4) Catholics choose significantly 
fewer Jews but not significantly fewer Protes- 
tants than these groups, in turn, choose 
Catholics; and (c) Jews choose significantly 
more Protestants and significantly more 
Catholics than these groups, in turn, choose 
Jews. 





Reuicious ErHNocentrismM AMONG ADOLESCENT Boys 


Both Protestants and Catholics receive 
more choices than they make, while Jews 
receive fewer choices than they make. It is 
thus apparent that the Jewish group is the 
least chosen, or the most excluded.* 

Is this bias recognized? This is the prin- 
cipal question. The measure of recognition 
is the correspondence of the guesses made by 
a group with the choices received by the 
group (Table 5). 

Table 5 shows that the distribution of 
guesses made by each group is more nearly 
like the distribution of choices the group re- 
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lics fail to recognize the extent to which the 
Jewish students choose them. There is no 
significant difference, however, between the 
per cent of guesses made by the Jewish group 
and the per cent of choices received by them. 
Thus, the Jewish group guesses at least as 
well as the other two groups, despite the 
great discrepancy between the choices they 
receive and the population distribution. 
There is further evidence of the Jewish 
students’ awareness of the situation. Since 
Jewish students receive fewer choices than 
they make from Protestants and Catholics, 


TABLE 5 


Group Guesses CoMPARED WITH CHorces RECEIVED 








DIFFERENCE PER CENT 
BETWEEN or 
PERCENTAGES PoPpULATION 


PER CENT 
CHoIcEs 
RECEIVED 


PER CENT 
GUESSES 
Mave 


Group 





Protestants in relation to: 
Protestants 83.8 
R.C.’s 11.7 
Jews 4-5 

o 


Total 100.0 (N=1245) 


+1.7 
o 


Shed 
(N=2134) 





Catholics in relation to: 
Protestants 75. 
R.C.'s 21. 
Jews 3. 

Total 100. 





Jews in relation to: 


Protestants 51. 
R.C.'s 8. 
Jews 39- 

Total 100. 





(N=133) 





+ indicates more guesses than choices received. _ 
— indicates fewer guesses made than choices received. 
* 5% level of significance. 


ceives than the distribution of the population. 
In general, it seems that each group guesses 
well the response it receives. There are, 
however, two significant discrepancies be- 
tween guesses made and choices received. 
Both the Protestants and the Roman Catho- 


4 Professor G. W. Allport has pointed out to the 
writers that a group may adjust to exclusion by (a) 
choosing proportionately more of its own members, (4) 
choosing from the excluding group anyway, (c) with- 
drawing. Our data show that the Jewish group employs 
all three alternatives. They show in-group preference 
(Table 3), choose from the other groups even though 
the choices are not reciprocated (Table 4), and they 
make fewer choices than the other groups (the mean 
number of choices made by Jews is 3.7, by Catholics, 4.1, 
by Protestants, 4.2). Withdrawal is also shown by the 
per cent of students who want to room alone: 34 per 
cent of the Jewish group, 27.8 per cent of the Catholic, 
and 19.4 per cent of the Protestants indicate preference 
for rooming alone. 


their recognition of this discrepancy should 
appear in their directing proportionately 
fewer guesses than choices to Protestants and 


Roman Catholics. This is indeed the case. 
Protestants and Roman Catholics together 
receive 70.6 per cent of the Jewish choices but 
only 60.2 per cent of their guesses. The dif- 
ference is significant at the .05 level. 

In conclusion, even where the distribution 
of choices a group receives is quite different 
from the population distribution, the group 
accurately guesses the choices received. Two 
significant exceptions are the Protestant and 
Catholic guess of Jewish response to them. 
Both these groups, who are particularly 
ethnocentric in their choices of Jews, fail to 
recognize the extent to which the Jews choose 
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them. The Jewish students, who are the 
object of the Protestant and Catholic ethno- 
centrism, recognize the extent to which they 
are excluded. 


SUMMARY 


Students’ recognition of ethnocentric 
(biased) choice of roommate was investi- 
gated among Protestant, Roman Catholic 
and Jewish boys in a preparatory school. 

The data were obtained by means of an 
extension of a sociometric technique. The 
subjects were requested to write on a card the 
names of the students whom each would 
choose as a roommate, and then the names 
of those each guessed would choose him. 

Percentages of choices made by Protestants, 
Catholics, and Jews from their own groups 
were compared with the percentages of 
students in each group. It was found that 
each group made a greater proportion of 
choices of its own members and a smaller 
proportion from other groups than the pro- 
portional group sizes. That is, each of the 
three groups was found to be ethnocentric. 
Furthermore, although each group excluded 


others, the Protestant and the Catholic stu- 
dents excluded the Jewish students signifi- 
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cantly more than the Jewish students excluded 
them. 

When the guesses made by the groups were 
compared with the choices they received from 
other groups, it was possible to investigate 
the group’s recognition of the biased choos- 
ing. In general there is evidence that groups 
have good insight concerning the social re- 
sponse they receive from their own members 
and from members of other groups. Protes- 
tant students closely estimate the response 
from members of their own group and that 
from the Catholic students, but they signifi- 
cantly underestimate the response from the 
Jewish students. Catholic students closely 
estimate the response from members of their 
own group, and that from the Protestant 
students, but they underestimate the response 
from the Jewish students. Jewish students 
closely estimate the response from members 
of their own group and from the Protestant 
and Catholic students. 
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FORECASTING OFFICER POTENTIAL USING THE 
LEADERLESS GROUP DISCUSSION * 
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use of the leaderless group discussion 
technique to assess candidates for lead- 
ership positions have disclosed that while 
many industrial, military, and government 
agencies are making use of the procedure, few 
validity studies are available (1, 3, 4). How- 
ever, three published validity studies have 
reported similarly positive results. Correla- 
tions of .36 and .21 were obtained between 
discussion scores and ratings of success as 
foremen in two federal shipyards. The lower 
than expected validities were attributed 
partly to the unreliability of the criterion (7). 
A correlation of .44 was found between lead- 
erless discussion behavior and previously ob- 
tained buddy ratings of leadership status in a 
college fraternity (6). A third study (8) ob- 
tained an average correlation of .36 between 
ratings obtained during discussion and vari- 
ous criteria concerning performance as a civil 
servant obtained as much as two years later. 
In addition, a research study subsequent to 
the surveys (10) yielded an average correla- 
tion of .47 between scores based on leaderless 
discussions among fraternity pledges, each 
representing a different fraternity, and their 
leadership status six months later in their re- 
spective fraternities. 


St of the literature concerning the 


Tue ProspL—EM 


The present study was another attempt 
to measure the extent to which leadership 
potential could be forecast by means of 
the leaderless group discussion technique. 
Specifically, this study was made to indicate 
the extent of relationship between the tend- 
ency to attain leader status in initially lead- 


1The authors wish to express their appreciation to 
Col. N. C. Pilet (formerly PMS&T, LSU), Col. F. T. 
Berg (PMS&T, LSU), Col. R. P. Johnson (PAS&T, LSU) 
and Maj. G. E. Swindell (Personnel Officer, ROTC, 
LSU) without whose interest and cooperation this study 
would not have been possible. 

A condensed version of this article was read at the 
meeting of the American Psychological Association, Chi- 
cago, September, 1951. 
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erless group discussions and rated potential 
as a future Army or Air Force officer. 


Tue AssessMENT PRocEDURE 


Third year Advanced Reserve Officer 
Training Corps cadets frora 11 military 
classes at Louisiana State University, a semi- 
military school, were divided randomly 
(within classes) into groups of six, seven, or 
eight members. Thirty-five groups compos- 
ing 244 cadets were organized and each was 
subjected to a thirty-minute initially leaderless 
group discussion. Instructions and procedure 
for the LGD were similar to those adminis- 
tered in studies reported previously by the 
senior author and others (2, 5, 6). A definite 
problem for discussion, either technical or 
general, was assigned each group. An ex- 
ample of a technical problem was: 

The Cadet Corps is the only military force avail- 
able to defend the Mississippi River Bridge against 
a regiment of enemy paratroopers who are now 
dropping in St. Francisville. The only weapons and 
equipment available are those now on hand. You 
may organize civilians to help. In the next thirty 
minutes draw up a plan to defend the bridge for 
three days until expected reinforcements arrive. 


A general problem required the participants 
to enumerate the characteristics of a good 
officer; or, to select the outstanding military 
leader of the present and so forth. 

The observers (Os) used a nine-item rat- 
ing schedule? and indicated by a point sys- 
tem the extent to which each candidate ex- 
hibited the behaviors noted in the schedule. 
From an analysis made during a previous 
study (6), it was inferred that the highly 
interrelated items all concerned the extent 
to which each examinee attained leadership 
status in the thirty-minute discussion, i.¢., ex- 
hibited behavior which resulted in progress 
of the group in a shared direction. 

Each discussion was observed by two 
trained raters. Six graduate students in psy- 


2 For a description of the nine items, the reader is 


referred to (5). 
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chology, an Army lieutenant colonel and an 
Army major served in pairs as Os of one or 
more of the 35 group discussions.’ All Os 
were given several practice sessions. Exam- 
inees were oriented and instructed concern- 
ing the nature of the discussions and mo- 
tivated to perform maximally by the junior 
author, a retired regular Army colonel. It 
was felt that for this and other reasons moti- 
vation was high among cadets to do well 
on this “test.” 

Examination of the data disclosed that Os’ 
standards varied somewhat despite the prac- 
tice which had been given them. Therefore, 
ratings by each O were converted to standard 
scores. A participant's final rating (Z,), indi- 
cating the extent to which he attained leader- 
ship status in the initially leaderless group 
discussion, was the sum of the two standard 
score ratings assigned him by the pair of Os 
of his discussion. 

Although some subjects (Ss) were tested in 
six-man groups, while others were tested in 
seven- or eight-man groups, no correction for 
group size was necessary in the ratings they 
received since the effects were negligible 
in this particular instance because of such 
small variations in group size.‘ 


Tue CRITERIA 


As criteria of officer potential, the investi- 
gators used the mean rating given each cadet 
by from 10 to 20 cadet and tactical officers 
based on two years’ acquaintanceship in 
which many raters shared the same barracks 
with the ratees. The ratings were made ac- 
cording to a descriptive rating schedule for 
which directions for Army personnel were as 
follows: 

On the reverse hereof is a roster of all cadets (of 
a designated class). A rating in leadership is de- 
sired on each of these cadets whom you feel that 
you know well enough to judge fairly. In arriving 
at a rating consider such factors as forcefulness, ini- 
tiatit and military bearing, and atten- 
tion to duty. To arrive at the figure that represents 
your opinion use the following scale of values: 

Considering that some day this cadet may serve 


, 4ppearance 


8 Four pairs of Os were needed since many of the 


discussions were conducted simultaneously. 
*For an analysis of the effects of group size on 
leaderless discussion ratings the reader is referred to (5). 
5 The investigators had no control over these ratings 
which are standard operating procedures of the ROTC 


staff for constructing merit lists each year. 
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under you as a junior officer, your opinion is that 
you would: Fight to get him (40 points); Prefer 
him to most (20 points); Be happy to have him 
(10 points); Take a chance on him (5 points); Do 
not want him (o points). 


The Air Force rating procedure was sim- 
ilar to the Army technique except that it 
substituted a forced distribution rating sched- 
ule for the one used by the Army. The per- 
centage of Air Force cadets to be designated 
with a particular label were specified as fol- 
lows: 

Superior (5 per cent—4o points); Excellent (20 
per cent—30 points); Very Satisfactory (50 per cent 
—-10 points); Satisfactory (20 per cent—5 points); 
Unsatisfactory (5 per cent—o points). 

Raters were instructed to work independ- 
ently of other raters. 

The first set of criterion ratings was ob- 
tained in May 1950 on 100 of the Army 
cadets and 55 of th: Air Force cadets ® just 
prior to their entrance into Advanced ROTC 
and about six months before the administra- 
tion of the leaderless group discussions. A 
second set of criterion ratings was obtained 
in May 1951 for all but 7 of these same 
cadets.” 

Eighty-five of these cadets in the Army 
classes went to Infantry, Field Artillery, En- 
gineers, or Ordnance training camps during 
the summer of 1951. At the end of this time, 
their respective summer camp tactical off- 
cers rated the cadets using the same scales as 
used by the on-campus tactical officers. These 
ratings provided a third set of criterion rat- 
ings. 

There is evidence to indicate that immedi- 
ate criteria similar to the type used in this 
study are predictors of success in Officer 
Candidate School and are heavily loaded 


available for only 155 of the 
examined using the LGD 


6 Criterion data were 
244 cadets who had been 
because only these 155 had served in Basic ROTC at 
Louisiana State University. The others were veterans 
for whom Basic ROTC was not required or 
transfer students. The mean LGD scores of veterans 
and transfer students was only .og¢ below that of the 
mean of students who had served in Basic ROTC at 
this University and for whom criterion data were 
available in May 1950. 

7 Although criterion data were available in May 1951 
on almost all the 244 examinees, it was decided to 
analyze only the 1951 data for the 148 cadets also on 
the May 1950 criterion lists in order to keep constant 
as many conditions as while studying the 
changes in validity of the LGD, if any, from one 
year to the next. 


were 


possible 
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with a “leadership factor” (9). Since many 
future administrative actions such as grant- 
ing Regular rather than Reserve Com- 
missions will depend on these ratings, they 
also constitute an appraisal of future status 
as an officer. 


ASSESSMENT AND CRITERION RELIABILITIES 


Table 1 indicates the extent of agreement 
among the various pairs of observers used in 
this study in their standard score ratings of 
the extent to which the examinees attained 
leadership status in the 35 initially leaderless 
discussions. These correlations, when stepped 
up by the Spearman-Brown prophecy for- 
mula, provided estimates of the reliability of 
the leaderless discussion assessments (Z,). 
Raters 1 and 2 were regular Army officers. 
The others were graduate students in psy- 
chology. No particular military-civilian dif- 
ferences were discernible. The weighted 
average reliability as estimated by the aver- 
age correlation between pairs of raters (using 
Fisher’s z conversion) was .83. The weighted 
average corrected reliability of the assess- 
ment variable (Z,) was .gt. 

A test-retest LGD reliability of .53 was 


obtained when Ss were retested a year later 
in rearranged groups using different Os. 
The rate-rerate reliability of the criterion 
(as estimated by correlating the mean ratings 
received by cadets in May 1950 with those re- 
ceived one year later) was .64 for the Army 


TABLE | 


Extent oF AGREEMENT BETWEEN Pairs OF OBSERVERS 
Ratinc THE AMouNT oF LEaper Status ATTAINED 
By EXAMINEES DURING INITIALLY LEADERLESS 
Discussions 








EsTIMATED 
RELIABILITY 
OF Zu 


No. AGREEMENT 
EXAMINEES BETWEEN 
RATED OBSERVERS 


OBSERVER 
Pair 





70 86 -92 
78 79 . 88 
48 .80 89 
20 71 -83 
8 -90 -95 
14 -9I -95 
6 -98 -99 


AvVnNnN Uw & 


244 .83° .g1* 


= 





* Weighted average obtained by using Fisher's z 


conversion. 
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and .50 for the Air Force. These coefficients 
reflected three sources of error in the criter- 
ion ratings—the lack of agreement among 
raters (many of whom were different in 1951 
from those in 1950), the lack of consistency 
in the raters from one year to the next, 
and changes in the perceived leadership po- 
tential of the ratees from May 1949 to May 
1951. 

The criterion reliabilities were higher than 
those obtained in OCS (9) probably because 
in the present study the first and second eval- 
uations were made after a much longer 
length of acquaintanceship between raters 
and ratees. 


VaLipiry oF THE LGD 


Table 2 shows the correlations obtained for 
the 7 Army and 4 Air Force classes between 
the extent to which examinees attained lead- 
ership status during one of 35 initially lead- 
erless group discussions (Z,) and their rated 
leadership potential as indicated by the 1950 
and 1951 criterion ratings. The number of 
examinees per class included in these analyses 
is also shown. 

The weighted average class-by-class valid- 
ity for the Army classes (using Fisher’s z 
conversion) was .41 in 1950 and .37 in 1951. 
For the Air Force, the corresponding validi- 
ties were .48 and .46. When all Army sub- 
jects were pooled into a single sample, the 
resulting validity was .32 for 1950 and .31 for 
1951. For the Air Force subjects the cor- 
responding pooled sample validities were .40 
and .42. The summary validities are shown 
in Table 3 corrected for attenuation due to 
the unreliability of the criteria. The squares 
of the summary validity coefficients are esti- 
mates of the proportion of variance in lead- 
ership potential among future officers which 
could be accounted for by the LGD if leader- 
ship potential was constant from year to 
year and raters were in perfect agreement in 
appraising such a factor. 

Although the criterion ratings changed 
substantially from one year to the next, the 
validities as a whole did not. It would seem 
that the LGD was measuring some stable 
component of rated leadership potential but 
what this stable component is remains un- 
known. 


The validities appeared to be higher for 
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TABLE 2 


CorRELATION BETWEEN LeapersHtp Status Atrainep Durine Inrriacty Leapertess Discussions (Z,) AND 
Ratep Orricer Porentia, ror Two Successive YEARS AND 1951 SUMMER CaMP 








NuMBER oF Cases 


VALIDITIES 








1950 


1950 





Field Artillery 

Engineers 

Ordnance 
Army Total 


Air Force 
Administration 35 
Aircraft Maintenance 20 19 
Air Force Total 55 52 


+33 
+24 
44 
-45 
«32° 


-39 
+50 
-40° 





* Correlations based on the pooled classes or total number of cases. Against the 1950 criterion, for the Army, 


the weighted mean class validity was .41 using Fisher’s z conversion. 
the 1951 criterion, the corresponding validities were .37 for the Army and .46 for the Air Force. 


For the Air Force, it was .48. Against 
Against the 


summer camp 1951 criterion for the Army only, the corresponding validity was .40. 


the Air Force samples than for the Army 
samples, however these differences were prob- 
ably fortuitous. 


SUMMARY AND CONCLUSIONS 


Army and Air Force third year ROTC 
cadets drawn from 11 classes at a semi-mili- 
tary college participated in one of 35 initially 
leaderless discussions in a group of 6, 7, or 
8. Pairs of trained Os estimated the extent 


TABLE 3 


Pootep Crass AND AveRAGE Crass CORRELATIONS BE- 
TWEEN LeapersHip Status Atrainep Durine INt- 
TIALLY LEApERLEss Discussions (Z,) AND RaTED 
Orricer PorentiaL For Two Successive 
Years aND Army SUMMER CAMP 1951, 
Correctep For ATTENUATION DuE To 
THE UNRELIABILITY OF THE CRITERIA 

or Orricer PoTEeNTIAL 





WEIGHTED 
AVERAGE 
Crass VaLipiry 


PooLep 
Crass VALipiTy 





Arm AIR 


Year ARMY Force Army Force 





1950 .56 -51 .68 
1951 4 59 44 65 
1951 Summer Camp ° -- -53 -- 
Average for 

two years , .58 .48 .66 
Average for 

two years & 

summer camp 





to which each S attained leadership status 
during the given discussion in which the S 
participated. These estimates were correlated 
with one set of standardized military ratings 
of officer potential of the participants made 
six months previously by tactical officers and 
cadet officers and another set made six 
months after the LGD administrations. Rat- 
ings based on performance in different Army 
camps were also collected. Correlations be- 
tween LGD scores and criteria were obtained 
for small classes of Infantry, Field Artillery, 
Engineers, Ordnance, and Air Force cadets. 
A weighted average of these validities based 
on two years’ criterion ratings of leadership 
potential (corrected for attenuation due to 
criterion unreliability) was .48 for the Army 
cadets and .66 for the Air Force cadets. Cor- 
responding pooled sample validities were 
slightly lower. 

The validities reported in this study rep- 
resent another verification of the hypothesis 
that the attainment of leadership status dur- 
ing initially leaderless discussions is predictive 
of leadership status and leadership potential 
outside of the discussion. Since most of the 
examinees in these samples receive Regular 
or Reserve Army and Air Force commis- 
sions upon completing training, the validi- 
ties reported in this study actually may be 
underestimates of the forecasting abilities of 





Forecastinc Orricer PorentiaL Usinc THE Leapertess Group Discussion 


the LGD, if it is used to screen applicants 
for OCS or ROTC. 
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ANY important questions about per- 
M sonality can scarcely be raised with- 

out considering its relationship to 
inter- and intrapersonal conflict. Very ob- 
vious for example are the implications of all 
those clinical observations, particularly ob- 
servations on children during their formative 
years, which relate personal insecurity and 
perhaps inadequacy to intrafamilial conflict. 
In such studies the personality as an objective 
fact occupies the center of attention, and 
scientific work proceeds along the classic 
lines of observing how the properties of the 
personality as an objective entity are depend- 
ent upon external circumstances. 

There is, however, another sense in which 
personality can be considered in relation to 
conflict, and this sense is one which, to the 
authors’ knowledge, has hitherto escaped 
systematic investigation. Personality may be 
regarded primarily as an impression which 
is formed by another person; from this view- 
point questions may be raised as to the way 
in which such an impression is affected by 
the presence of interpersonal conflict. In 
work of this kind, the dependent variable of 
the investigation is not the personality of a 
person, x, but rather the impression which 
another person, y, forms of x’s personality. 
Or it may be the impression which x forms 
of his own personality. 

The distinction between personality as an 
impression, and personality as an objective 
fact is not original with us; e.g., it has often 
been used by practitioners in the professions 
of social work, clinical psychology, and psy- 
chiatry, many of whom often appear to be 

1 This investigation has been supported (in part) 
since 1947 by a research grant from the National Insti- 
tute of Mental Health, of the National Institutes of 
Health, Public Health Service. Im addition, help is 
acknowledged from funds received in 1946 from the 
University of Pennsylvania School of Medicine and from 


funds received in 1943-1945 from the Johnson & John- 
son Research Foundation. 
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persuaded that the important questions to be 
raised about the personality of another per- 
son have to do with the impression, or as it 
is often said, the feelings, which it excites in 
another, rather than with facts of the objec- 
tive status of that personality. At the level 
of theory, the distinction has recently been 
the subject of very extensive treatment by 
Ichheiser (3), who finds an important, if not 
the decisive, locus of misunderstandings in 
human relations in the distinction. An ob- 
jective of the present investigation is to deal 
with a certain kind of conflict as a factor 
which conditions impressions of personality 
in such a way as to affect the frequency or 
the intensity of misunderstandings in human 
relationships. As such, a fruitful opportunity 
is presented for the extension of Ichheiser’s 
thinking at the experimental level. 

In the present work a conflict situation is 
understood as being a situation which divides 
persons into opposing camps. In such situ- 
ations it is intrinsic that “sides” be formed, 
and that persons in the situation be either on 
the same “side” or on different “sides.” 
Where people are divided into “sides” the 
following propositions hold: (a) by defini- 
tion, the two sides are striving for mutually 
exclusive goals; (4) since the behavior is 
goal directed and hence subject to frustration, 
it will necessarily (because of the existence of 
the mutually exclusive goals) be emotional 
in varying degrees, being dominated at times 
by hopes and fears, and dependent upon the 
importance of the goals and the imminence 
of frustration; (c) the hopes and fears of the 
persons on each side will be reciprocally re- 
lated—events which are potential sources of 
frustration to the one side will excite hope in 
the other; and (d) in the light of the fore- 
going fact, people on different sides will 
exhibit different behavior to the same event. 
In addition, it is not difficult to believe that 
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people in such a conflict situation will use 
different frames of reference in rationalizing 
the behavior of others, depending upon 
whether they are dealing with the behavior 
of a person on their own side or on the other 
side. All of this leads to strong suspicions 
that in conflict situations one finds a fruitful 
opportunity to investigate problems sug- 
gested by Ichheiser’s work. 

A good example of interpersonal relation- 
ships in which individuals will be found to 
occupy opposing positions on each of several 
issues is the marriage relationship, particu- 
larly when it is troubled. For this reason, 
data from marriage partners offer an especial 
opportunity for the study of the effects of 
conflict upon the impressions of personality. 
The present investigation, coming from a 
marriage counseling agency, is based upon 
such partners. 

Psychological work on the personality rat- 
ings of marriage-partners (either of self or 
of partner) represents the end-product of 
interest in the problem of homogamy (i.e., the 
problem of the likenesses of marriage part- 
ners). Such interest has found expression in 
studies of the intelligence of spouses (6), of 


the personalities of partners as indicated by 
scores on inventories (2), and finally of such 
personalities as indicated by ratings of the 


self and of the partner (5). All of these 
studies have been directed to the fundamen- 
tal question of homogamy—are marriage 
partners more alike or more different than 
pairs drawn at random from the general 
population? Of those who have worked 
with ratings, Kelly (5) has come closest to 
dealing with them purely as impressions, 
rather than as indices of the objective status 
of the personality from which conclusions as 
to homogany could be formed. 

Kelly’s work is based on the four ratings 
which are obtained when a husband and a 
wife each rate himself and his partner. The 
work reported herein is based upon similar 
ratings, although the treatment given the rat- 
ings differs materially from Kelly’s treatment 
of his data. The four ratings obtained from 
experiments of this kind include the hus- 
band’s self-rating (H,), the husband’s rating 
of his wife (H,,), the wife's self-rating (W,), 
and the wife’s rating of her husband (W,). 
Kelly has reported all differences between 
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arithmetic means which can be taken from 
the four distributions as he obtained them 
from a sample of 76 marriages. On the basis 
of this work he reports no differences in the 
mean ratings when H, and W, ratings are 
compared, or when H, and W, ratings are 
compared. Differences are observed, he re- 
ports, when H, and W, ratings are compared 
(the husbands’ self-ratings are less favorable 
than the ratings of the husbands by the 
wives) when W, are compared with H, 
(wives’ self-ratings are less favorable than the 
husbands’ ratings of the wives) when H, are 
compared with H,, (the husband’s self-ratings 
are less favorable than the husbands’ ratings 
of the wives) and when W, are compared 
with W, (the wives’ self-ratings are the less 
favorable). 

These facts suggest that the self-ratings are 
generally less favorable to the subject of. the 
rating, since in all comparisons where dif- 
ferences exist, the self-ratings suffer in com- 
parison with the ratings of the other person. 
A less obvious conclusion, but a logical de- 
duction from the data, is that discrepancies 
exist in the foregoing comparisons which 
cannot be accounted for by the existence of 
systematic differences in the personalities ob- 
jectively defined. Results in which the hus- 
band’s rating is both less than the wife’s (H, 
is less than H,,) and greater than the wife’s 
(W, is usually under W,) show that these 
are not completely valid judgments of differ- 
ences in personality objectively defined. 

A better test of the validity of the ratings 
which Kelly studied can be made by investi- 
gating certain of their intercorrelations which 
were not used in his work but which repre- 
sent the core of the technique in the work 
described here. Any investigation which 
produces the four distributions of ratings un- 
der discussion (H,, H,,, W,, and W,) makes 
possible the calculation of six coefficients of 
correlation. The six coefficients fall into four 
classes: 

1. Correlations between ratings of self and 
of partner. Of this class there are two mem- 
bers, (2) H, and H,, and (4) W, and W,,. 
From such coefficients of correlation we can 
determine the extent to which either hus- 
bands or wives rate themselves similarly to 
their partners. 

2. Correlations between the spouses’ self- 
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ratings. Of this class there is but one mem- 
ber, i.¢., the correlation between H, and W,,. 
From it we can conclude as to the extent of 
similarity in such self-ratings. 

3. Correlations between the spouses’ ratings 
of each other. Of this class there is but one 
member, i.e., the correlation between H,, and 
W,. No special importance is attached to this 
coefficient in the present investigation. 

4. Correlations between self-ratings and 
ratings of oneself made by the spouse. Of 
this class there are two members, i.e., the 
correlation between H, and W,, and the 
correlation between W, and H,,. No special 
importance is attached to this class of coefh- 
cient in the present investigation, although 
a later publication will reveal that such co- 
efficients are members of a larger class, i.e., 
the class of all events on which husbands and 
wives can be asked to make a simultaneous 
report, which has important significance. 

It should be pointed out that the foregoing 
analysis exhausts all possibilities—no other 
intercorrelations between the ratings exist. 

The coefficients in Class 2 above must be 
contrasted with the two members of Class 
1 if their full significance is to be perceived. 
In the first class, questions are raised about 
the extent to which the same person rates his 
own and another person’s personality as being 
similar. In the second class, questions are 
raised about the extent to which these two 
people entertain similar views of themselves. 
If both coefficients of the first class are high 
while the second class coefficient is low, it is 
signified that the husband rates himself and 
his wife similarly, as does the wife. At the 
same time it is signified that the two spouses 
each have a different view of the aspect of the 
personalities under assessment. The latter 
fact is of major significance in the theoretical 
development of this investigation. It is taken 
to signify that either one or the other or both 
of the partners are systematically in error in 
their ratings; i.c., that one or the other or 
both of the ratings are being influenced by a 
factor other than the personality under as- 

2 
sessment.” 

The specific objective of the present re- 
search is to be reached by making systematic 

2 The results will also have an important bearing upon 


the problem of the validity of such ratings as predictors 
of personality objectively considered. 
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examination of the coefficients of correlation 
in Classes 1 and 2 as just described, as made 
by spouses known to be troubled in one or 
more aspects of their marital relationship, and 
as made by spouses concerning whom no 
evidence available to the investigators indi- 
cated that they were troubled in their mar- 
riages. The reaching of such an objective is 
worth while because it permits consideration 
of (a4) a problem of general interest in con- 
temporary social science—the problem of the 
interrelationships between our opinions of 
others, our attitude towards them, and our 
common relationships to conflict, and (4) a 
problem of specific interest to those who must 
work with problems of marital strife and dis- 
cord. For the latter, there is hope for the 
development of techniques of diagnosis and 
counseling from systematic knowledge about 
the way in which impressions of personality 
are related to the experience of marital con- 
flict. In addition, it is an objective of the 
present experiment to evaluate the validity of 
ratings of personality, as such validity relates 
to personality as an objective entity, and 
thereby to contribute to the criticism of con- 
clusions reached hitherto on the problem of 


homogamy in the area of personality, based 
on such ratings. 


METHOD 


The present investigation was carried out at the 
Marriage Council of Philadelphia, a private social 
agency offering counseling service to persons con- 
templating marriage and to married persons. Since 
1943 the agency has been collecting data systemati- 
cally for research purposes.8 

All clients at the Council routinely furnish infor- 
mation to the counselor by filling in schedules prior 
to or during counseling. Among these schedules 
is one, known as the Marriage Schedule, which 
inquires into a variety of details of the marital 
experience of the clients. A part of this schedule 
is concerned with the client’s self-appraisal of his 
personality and with the client’s appraisal of the 
spouse’s personality. The appraisal is made by use 
of a three-category rating scale, the categories being 
Considerable, Little, or Not at All. The scale is 
applied to each of 17 aspects of the personality. 
The aspects of the personality used are among those 
used by Burgess and Cottrell (1) and Kelly (5). 
They are listed in Table 2. 

Clients who are counseled premaritally cannot, 
in the nature of things, return information on their 
marital experience at the time of the counseling. 


8 The authors acknowledge with grateful appreciation 
the help of Dr. Ernest W. Burgess who participated in 
the framing of certain of the schedules used in this 
investigation. 
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Many of these clients, however, are in touch with 
the agency during the year to a year and a half 
following their marriage, and at such times fill out 
the Marriage Schedule, which contains the per- 
sonality ratings. Clients who were counseled pre- 
maritally and who on return to the agency after 
marriage were counseled with respect to a marital 
problem were not used in this study. 

In many instances involving premarital as well 
as marital counseling, both partners are under 
counseling at the agency, in which case Marriage 
Schedules are available from both partners. In 
those cases in which only one spouse is counseled, 
the other spouse is often asked to fill in the sched- 
ules; this request is honored in many cases. 

At the beginning of the present investigation 
(July 1940), the files of the agency were searched 
in order to produce all instances in which both 
spouses had filled in the Marriage Schedule con- 
taining the personality ratings, together with a 
schedule pertaining to experience in the engage- 
ment period. This search yielded 116 pairs of cases 
in which either one or both spouses were counseled 
postmaritally and 55 pairs of cases in which either 
one or both spouses were counseled premaritally 
(none having been counseled postmaritally), all 
spouses having filed the critical schedules. 


Characteristics of the Samples 


The two samples differed in many respects, 
including marital history, duration of marriage, 
age of the partners, length of time the respondents 
reported themselves as being unhappy, extent of 
confiding reported to occur in the marriage rela- 
tionship, degree of satisfaction with affection re- 
ported by the partners, methods used in resolving 
disagreement, happiness of their parents as reported 
by the respondents, happiness in teens as reported 
by respondents, and personality attractiveness scores 
developed from the personality ratings. 

Most of the clients counseled postmaritally had 
been seen in connection with problems which, while 
they differed in seriousness and complexity, consti- 
tuted a difficulty either in the personal adjustment 
of the person under counseling or in the marriage 
relationship. These problems are currently under 
investigation and will be reported at a later time. 
None of the persons counseled premaritally revealed 
marital problems for which they asked counseling 
at the time the schedule was filed (which, of course, 
was after they had married). 

As to the remaining differences (excepting the 
differences in personality attractiveness scores), 
Table 1 gives the relevant statistics. It is evident 
from the table that the postmaritally counseled 
group was comprised of persons whose marriages 
were of longer duration, who were older, who 
reported themselves as unhappy for periods in 
excess of a year (on the average), who were more 
likely to report that confiding was relatively absent 
in the marriage, who were more likely to report 
dissatisfaction with the amount of affection in the 
marriage, who were less likely to report that dis- 
agreement was resolved by mutual give and take, 
whose parents were more likely to be reported as 
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being unhappy in their marriage, and whose own 
teen ages were more likely to be reported as 
unhappy. It is also evident from Table 1 that 
minor differences exist in degree of education 
(though small, one of these is statistically signifi- 
cant), in the extent of religious activity reported, 
and in the number of children reported in the 
participants’ families. 

Table 2 gives the mean and standard deviations 
of the eight distributions of personality-attractive- 
ness scores developed from the ratings of self and 
partner by each of the two spouses in each of the 
two experimental groups. These scores were ob- 
tained by combining the personality ratings (as 
listed in Table 3) into scores by arbitrarily weight- 
ing the alternatives (much, some, or little) on each 
trait by assigning o to the most attractive of them, 
1 to the next most attractive, and 2 to the least 
attractive alternative. A raw personality-attractive- 
ness score consisted of the sum of the numbers thus 
assigned to the traits. In order to equalize scores 
which differed because an occasional respondent 
failed to rate all traits, the score used in the analysis 
was the percentage of the possible score over those 
traits which were rated. Since each person rated 
himself and his spouse, each person produced two 
scores, and each marriage produced four. In using 
Table 2 it must be remembered that the larger the 
score the less favorable the judgment. The table 
indicates clearly that all ratings, both of self and of 
other, suggest that the postmarital group is reported 
as having less attractive personalities than the pre- 
marital group. Comparing means on like distribu- 
tions (e.g., husband’s report on himself), all four 
differences are reliable at less than the .or level.5 


*In order to avoid making arbitrary decisions as to 
whether a rating of “much” on each of the traits listed in 
Table 3 denoted an attractive or unattractive personality 
trait, all traits used on the schedule were distributed to 
approximately 40 married graduate students and faculty 
members at the University of Pennsylvania with a request 
to check each of the traits as to whether a judgment of 
“much” denoted an attractive or wumattractive trait. 
These judges were unanimous or just short of unanimous 
in their reports on 14 of the 17 traits. The 14 traits 
(stated in the framework of personal attractiveness) 
were: does not anger casily, gets over anger quickly, 
takes responsibility willingly, not stubborn, not selfish, 
sense of humor, not easily hurt, makes friends easily, not 
easily influenced by others, docs not act impulsively, not 
easily depressed, easygoing, not easily excited, not 
jealous, and punctual. 

5 It may be of interest to the reader also to note that 
in the postmaritally counseled group, the men are more 
severe on their wives than they are on themselves, 
while the wives are more severe on themselves than they 
are on their husbands. The same tendency is seen in 
the data of the premaritally counseled wives (but not in 
the data of the premaritally counseled husbands). In 
both distributions the wives are more severe on them- 
selves than the husbands are on themselves. In both 
distributions the men are more severe on the wives than 
the wives are on the husbands. The present data con- 
firm Kelly's results, previously described in part, but 
do not confirm that conclusion which states that re- 
spondents are generally more severe on themselves than 
they are on others. The latter conclusion fails with both 
groups of males in the present investigation. 
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TABLE 1 


DIFFERENCES BETWEEN EXPERIMENTAL GROUPS 








CouNSELED CouNsELED 


ITEM 
PosTMARITALLY PREMARITALLY 


ConFIDENCE LEVEL 
oF DIFFERENCE 





Duration of marriage (years) Mean 5. Mean 
SD 4- SD 
Age of participants (years) 
Husbands Mean 31. Mean 
SD 6. SD 
Wives Mean 28. Mean 
SD 6. SD 
Length of time reported 
unhappy (years) 
Husbands ‘ Not 
, Applicable 
Wives . Not 
Applicable 
Percentage reporting that 
confiding took place 
on few things 
Husbands 


Wives 
Percentage reporting lack 
of satisfaction with degree 
of affection 
Husbands 


Wives 
Percentage reporting resolving 
disagreement by mutual give 
and take 
Husbands 
Wives 
Percentage reporting marital 
unhappiness of own parents 
Husbands 
Wives 
Percentage reporting own 
unhappiness during teens 
Husbands 23 
Wives 23 13 
Education (years) 
Husbands Mean 14. Mean 15. 
SD 3. SD . 
Wives Mean 14. Mean 15. 
SD 3. SD I. 
Percentage reporting active 
or very active in re 
ligious attendance 
Husbands 
Wives 
Number of children in 
own family 
Husbands 


Wives 


<.01 


<.01 


<.o1 


Not 
Applicable 
Not 
Applicable 


Not 
Computed 


“ 


Not 
Computed 








IMPRESSIONS OF PERSONALITY AS 


TABLE 2 


Frequency DistrisuTions oF PersoNnaLity- 
ATTRACTIVENESS SCORES 








PosTMARITAL GROUP 


Me W, 








47 44 
16 15 





PREMARITAL GRouP 





Mean 27 27 33 
SD 13 12 11 





The foregoing account of the differences be- 
tween the groups indicates that these differences 
are four in number. First, on the basis of their 
histories, as they were known to the agency, and 
on the basis of their reports to relevant items of 
the marriage schedule, it is evident that the post 
marital group contained many people whose mar- 
riages were less than a satisfaction to them, while 
the premarital group when examined after mar- 
riage contained few if any such persons. Second, 
the postmarital group contained a larger proportion 
of people who reported unhappiness in other con- 
texts than their marriage, i.c., in their own teens 
or in their parents’ marriages. Third, the post- 
marital group was older and had been married 
longer than the premarital group. Finally, both 
husbands and wives in the postmarital group rated 
themselves and their spouses in such a way as to 
produce personality attractiveness scores which 
differed significantly from those of the premarital 


group. 
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REsuULTs 


Correlations of personality ratings made by 
the two sets of spouses in the present investi- 
gation were studied trait by trait as well as 
by the personality-attractiveness score previ- 
ously described. For each measure (rating or 
score), a measure of statistical dependence ® 
was calculated between (a) the husband’s 
self-rating (H,) and his rating of his wife 
(H,,), (8) the wife’s self-rating (W,) and her 
rating of her husband (W,), and (c) the 
husband’s self-rating (H,) and the wife’s 
self-rating (W,). 

Table 3 lists the various traits and gives the 
coefficients of mean square contingency for 
each of the 17 traits. The table also gives the 
median coefficient of each type, over the 17 
traits, and the product-moment coefficients 
based upon the previously described person- 
ality-attractiveness score. The latter coeffi- 


cients constitute a check upon the coefficients 
calculated from the ratings on the individual 
traits. The symbols H,H,, denote the corre- 
lation between H, and H,, as previously de- 
fined. The remaining column headings are 


® Coefficients of mean square contingency were com- 
puted when the traits were studied individually, and 
Pearsonian product-moment coefficients were computed 
when the personality-attractiveness scores were under 
consideration. 


TABLE 3 


CoEFFICIENTS OF MEAN Square CoNnTINGENCY BAsED UPON PersonaLity Ratincs Mape sy Boru Spouses 
IN THE Two ExpeRIMENTAL Groups 








CouNSELED PosTMARITALLY 


CouNSELED PREMARITALLY 





Trait 
H.iHe 


H.W, 


WeWr H.wHe H.W, WoW 





Angers easily -39 
Gets over anger quickly oa? 
Takes responsibility willingly -24 
Stubborn -55 
Selfish -57 
Dominating -41 
Sense of hurnor -40 
Easily hurt -45 
Makes friends easily -14 
Likes belonging to organizations +33 
Easily influenced by others .40 
Acts impulsively -28 
Easily depressed -37 
Easy-going -14 
Easily excited -29 
Jealous a 
Punctual -39 
Median -37 
Attractiveness scores 42 
(Product-moment coefficients) 


-17 -35 -52 «19 -39 


-14 -36 A -33 -62 


-14 -14 ‘ -30 -76 


+22 -52 ‘ -49 .66 


-25 -39 : -39 -37 
37 -37 ‘ -17 -39 
-10 -45 ‘ +33 73 
-10 -45 ‘ -25 -57 
~24 8 -6 -29 -29 
-33 41 ‘ -34 +25 
-17 31 ‘ 25 -41 
-14 087 ‘ -19 -39 
-19 -31 , -34 -45 


-24 33 P -30 -59 
-19 -25 ‘ +41 -41 


-22 -42 K -55 -73 


+27 +29 . -17 -62 
-19 -35 . +30 +57 
-19 -33 +15 -63 
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defined in accordance with the same scheme. 
The following facts are evident in Table 3. 

1. In both groups association exists between 
the spouse’s report of himself and his report 
of his partner. Four coefficients are relevant 
to this statement, i.e. those in the first, 
third, fourth and sixth columns. From the 
medians and the _personality-attractiveness 
scores spouses tend to rate self and partner 
similarly in personality traits. 

2. Spouses counseled premaritally show 
more association between their views of them- 
selves and of their partner than do the post- 
maritally counseled spouses. 

3. In both groups, the lowest degree of as- 
sociation exists between the self-ratings of the 
spouses. These coefficients appear in columns 
2 and 5. The rating which each person makes 
of himself possesses little similarity to the 
self-rating of his spouse. 

A critical illustration may clarify the mean- 
ing of the foregoing facts. The facts suggest 
that of two spouses, one may rate himself and 
his partner as possessing a very acute sense of 
humor, while the other reports that both 
possess very little sense of humor. In each 


case the pair will contribute positively to any 


correlation between a self-rating and a rating 
of the partner. At the same time, they will 
contribute negatively to the correlation be- 
tween the self-ratings. 

4. No material difference exists between 
the two groups in the extent of the similarity 
in the self-ratings. The relevant values are 
19 and .30 (medians) and .19 and .15 (per- 
sonality-attractiveness scores). 

The statistical reliability of the results sup- 
porting the foregoing statements is very high. 
The results of the investigation of this aspect 
of the problem are as follows: 

1. Tendency for ratings of self to correlate 
with ratings of spouse. All 66 of the coefh- 
cients of mean square contingency are posi- 
tive in sign. When the frequency is 116, the 
product-moment coeflicient at the .o1 level is 
less than .20. Both coefficients based on the 
postmarital group exceed this quantity. 
When the frequency is 58, the product-mo- 
ment coefficient at the .or level is less than 
.40. Both coefficients based on the premarital 
group exceed this amount. 

2. Tendency for ratings of self and spouse 
to correlate higher than self-ratings. At the 
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level of the individual trait this question can 
be raised on the basis of 68 comparisons, In 
61 instances the fact is observed. Four tests 
can be made of the question using the prod- 
uct-moment coefficients. The two tests based 
on the postmarital group indicate significance 
at the .o5 and .20 levels. The two tests based 
on the premarital group each indicate sig- 
nificance at less than the .o1 level. 

3. Tendency for ratings of self and spouse 
to correlate higher for premarital than post- 
marital groups. At the level of the individual 
trait this question can be raised on the basis 
of 34 comparisons. In 32 instances the asso- 
ciation is greater in the case of the premari- 
tals. Two tests can be made of the question 
using the product-moment coefficients. The 
difference between wives is significant at the 
.o2 level, and the difference between hus- 
bands is significant at the .or level. 

4. Similarity in self-ratings. Although 16 of 
the 17 comparisons, using the individual 
traits, indicate larger correlation between self- 
ratings in the case of the premarital group, 
the difference between the coefficients as well 
as their absolute values are very small. 
Neither of the differences between medians 
or between coefficients of correlation based 
upon the personality-attractiveness scores is 
statistically significant. The direction of such 
differences, however, is consistent with what 
is observed in the coefficients based on the 
individual traits. 

The two groups under investigation differ 
materially in respect to marital history, age, 
duration of marriage, answers to questions as 
to various aspects of their marriages, answers 
to questions as to their parents’ marriages, 
their own teen-age happiness, and their per- 
sonality ratings, both with respect to them- 
selves and to their spouses. In lesser degree 
they differ in respect to education. In the 
light of this fact it is necessary to determine 
whether the tendencies in the distributions of 
coefficients of correlation are attributable to 
the difference in the atmosphere of the mar- 
riage suggested by its history and the an- 
swers to questions dealing with the atmos- 
phere, as given by the partners, or to such 
matters as age of partners, duration of the 
marriage, and, most important, the general 
level of attractiveness with which the re- 
spondents reported their own and _ their 
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spouses’ personalities. To settle these ques- 
tions, coefficients of correlation were calcu- 
lated separately for the postmarital group in 
order to discover any tendencies for the corre- 
lations between ratings to be dependent upon 
these other respects in which the groups dif- 
fered. The postmarital group was chosen 
for this analysis because of the substantially 
larger differences existing within it, a fact in- 
dicated by the larger standard deviations re- 
ported in the analysis of the variables under 
consideration. 

To reach this objective each respondent's 
schedule was analyzed in order to discover 
the extent of agreement in his ratings of him- 
self and his partner. The extent of agree- 
ment was calculated by counting the num- 
ber of traits in which the respondent rated 
himself and his partner identically. This 
value was then correlated with each of the 
variables in respect of which the premarital 
and postmarital groups were known to dif- 
fer. Where the obtained coefficient was high, 
it was evident that the tendency to rate self 
and partner similarly was dependent upon 
the variable under investigation. Hence, such 
a variable could be used as a basis for explain- 
ing the differences in the correlations obtained 
from the pre- and postmarital groups. Where 
the correlation was low it was evident that 
the variable in question could be eliminated 
as a factor accounting for the difference in 
the correlations on the trait-ratings obtained 
from the two groups. Table 4 summarizes 
the results of this work. 

Table 4 indicates clearly that no great de- 
gree of association exists between the vari- 
ables differentiating the two groups and the 
tendency to rate self and spouse similarly, 
except in instances which are related to the 
atmosphere of the marriage. The coefficients 
of correlation (excepting certain coefficients 
related to the atmosphere) are generally in 
the range under .30 and no great consistency 
is observed in the signs. The .or level of con- 
fidence in the postmarital sample is of the 
order of r at .30; from this fact and the lack 
of consistency, it is evident that the only de- 
pendable coefficients are those which have 
to do with methods of resolving disagreement 
and the extent of marital happiness. In the 
former case the coefficient signifies that those 


TABLE 4 


CoEFFICIENTS OF CORRELATION BETWEEN Per CENT 
or Exact AGREEMENT IN RaTINGs OF SELF AND 
PARTNER AND VARIABLIs IN RESPECT OF 
WHICH THE EXPERIMENTAL GROUPS 
Durrer 








PosTMARITAL 


Husband Wife 


VARIABLE 





08 
17 
.06 
-33 
19 
-73 
40 
.28 
.09 
.08 
.07 
10 
20 
28 


Duration of marriage 

Age of spouse 

How long unhappy 

Extent of confiding 

Satisfaction with affection 

Methods of resolving disagreement 
Own married happiness 

Parental married happiness 
Teen-age happiness 

Education 

Religious activity 

Children in participants family 
Own personality-attractiveness score 
Spouse’s personality-attractiveness score 


27 
+25 
15 
.18 
-10 
61 
44 
.29 
-06 
.08 
.05 
07 
20 
-§2 


LI ++t+ I++ 14411 
L 1) dtt++++44 | 





who reconcile differences by mutual give and 
take tend to report themselves and their part- 
ner alike. These latter coefficients, it will be 
noted, are of a somewhat larger order of 
magnitude than the coefficients in Table 3 
(postmarital group) which describe the ex- 
tent of the tendency to rate self and partner 
alike. 


Discussion 


In both classes of marriage under investiga- 
tion, the spouses show a strong tendency to 
report their own and their partner’s personali- 
ties in similar terms. In both classes of mar- 
riage the resemblance in the self-ratings of the 
two partners is substantially less than the re- 
semblance in the ratings of self and partner. 
Finally, the tendency to report self and part- 
ner in similar terms is stronger in the 
marriage in which no evidence of poor adjust- 
ment exists than in the marriages concern- 
ing which we have evidence of unhappiness. 
Such results invite discussion from both 
theoretical and practical viewpoints. Evi- 
dently they are relevant to the understand- 
ing of the ways by which we form impres- 
sions of the personality of ourselves and of 
others. In addition, the size of the coefficients 
of correlation suggests novel uses to be made 
of rating forms in diagnosing maladjustment 
and in conducting counseling. 

From the theoretical viewpoint, we have to 
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inquire as to how the circumstances of inter- 
personal relationships in marriage function 
to determine and control the correlation be- 
tween impressions which are formed of the 
personality of oneself and that of the other 
person defining the relationship. Put in such 
words, the question is of very general interest 
since it implies larger domains of social rela- 
tionships. 

No direct answer to the question of how 
the atmosphere of a marriage determines and 
controls impressions of personality can be 
given from the data of this experiment. Not- 
withstanding, the data are very clear-cut as 
to the direction to be taken in establishing 
likely hypotheses. They indicate clearly, for 
example, that the effects are not associated 
with the age of the respondents, the lengths 
of the marriages, the degree of education of 
the respondents, or other factors which were 
submitted to direct examination. This means 
that no hypothesis which supposes that the 
longer two people know each other the more 
likely it is that each will perceive himself and 
his partner in a different way, will account 
for the facts. It means that no hypothesis can 
be entertained which supposes that the ca- 
pacity to discriminate between oneself and 
another person is dependent upon any factors 
believed to be associated with increasing age. 
Without doubt an acceptable hypothesis must 
in some way be associated with the central 
distinction between the groups—one group 
has a history of marital unhappiness, while 
the other group has a quite contrasting his- 
tory. 

The hypothesis which we offer supposes 
that impressions of personality are subject to 
different forces depending upon whether the 
persons involved are on the same side or on 
different sides of a conflict. As will be seen 
as the hypothesis is developed, the presence 
of a conflict exerts important influences upon 
discriminatory processes. Discrimination be- 
tween the self and the personalities of per- 
sons on the other side, in the case of those 
who are divided by a conflict, is increased. 
On the other hand, discrimination between 
self and those on the same side is decreased. 
The hypothesis, if accepted, accounts for the 
difference between the pre- and the postmarital 
groups because those groups are known to 
differ in the extent to which members have 
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been on the “same” as against “different” 
sides." 

Why does the experience of being on the 
“same” or “different” sides affect the forma- 
tion of impressions of personality? ‘The ques- 
tion is not difficult, but a complete answer 
requires the use of an additional concept. 
First, people on the same side behave in the 
same way to changes in the situation which 
divided the group into sides, while people on 
different sides behave in quite different ways 
to the same changes in the external situation. 
People on the same side have their hopes 
similarly aligned to changes in the situation. 
With people on opposite sides, on the other 
hand, what excites hope in the one excites 
fear and despair in the other. All of this 
sums up in the statement that the presence 
of a conflict predisposes to the production 
of uniform behavior over those on the same 
side and to the appearance of diverse be- 
haviors in those who are divided by the 
conflict. 

The mention of the hopes and fears which 
are associated with many conflicts makes it 
convenient to introduce the additional con- 
cept mentioned above. Level of aspiration 
theory has been fortunate in receiving a clear 
definition of the terms realism-irrealism at 
the hands of Irwin (4). According to Irwin, 
a realistic aspiration (or an expectation for 
that matter) is one which is based upon ex- 
ternally conditioned fact perceived to be rele- 
vant to the situation by the person enter- 
taining the aspiration, while an_ irrealistic 
aspiration is one which is conditioned by in- 
ternal factors, of which hopes and fears offer 
a good example. The data of the present 
investigation show conclusively that neither 
group was behaving realistically, since the 
pattern of the observed correlations is incon- 
sistent with such an hypothesis. Such a con- 


7 We should like to point out that an hypothesis of 
this kind is very ingratiating because it runs parallel 
with certain aspects of other observations of behavior in 
group situations. Allies in international conflict com- 
monly see themselves as being similar only so long as 
they are allies. It is not necessary to labor the applica- 
tion of this statement in the history of the last 10 years, 
but we may note in passing that at the level of news- 
papers and news magazines as well as at the level of 
everyday intercourse, our awareness of difference be- 
tween ideological systems and between ourselves and 
persons associated with systems other than our own has 
increased with the disappearance of common goals and 
the spirit of the ally. 
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clusion must be rejected because the correla- 
tions between the self-ratings of the spouses 
are uniformly much less than the correlations 
between the ratings of self and partner no 
matter which spouse is studied. Further, the 
data of the experiment indicate conclusively 
that the happily married group exhibit a 
larger discrepancy between the relevant cor- 
reiation coefficients. From these two facts 
the conclusion is inescapable that the happily 
married groups show more evidence of lack 
of realism in their personality appraisals than 
the unhappily married group. To the extent 
that we are justified in generalizing this con- 
clusion it appears that in the marriage rela- 
tionship the judgments of people on the same 
side as to their own and their partner’s per- 
sonalities are apt to be less realistic than those 
judgments of people on opposite sides which 
deal with the personalities of persons on both 
sides of the controversy. The lack of realism 
characteristic of persons on the same side 
implies the presence of affective states, and 
we do not believe that it is any great assump- 
tion on our part to assume that happily mar- 
ried people wish to be alike in many if not all 
of those respects which we have submitted to 
study. Such a condition is suggested not only 
by everyday experience but also by the theory 
of projection.® 

There is a final respect in which people 
on the same side may be thought to differ 
from people on opposite sides. People on the 
same side may make the same allowances 
for themselves and their allies, while people 
on different sides may not make the same 
allowances for themselves and their oppo- 
nents (if they make any for the latter). This 
latter statement sums up in the assertion that 
people on the same side are using the same 
frame of reference or the same standards in 
appraising the reactions of themselves and 
their allies, while people on one side will use 


8 The present data offer no opportunity to test the 
hypothesis, but it is by no means impossible that very 
severe conflict between persons or between groups might 
result in irrealistic appraisal of the personality which is 
just the opposite of the kind noted here. Such an 
irrealistic appraisal would be observed if the correlations 
between self and the other person uniformly bore 
negative signs, with the correlations between the self- 
ratings in the neighborhood of zero. In such an event 
each of the two partners would be describing himself 
as wholly different from the other, at the same time 
giving up his ratings in such a way as to produce low 
correlation between the self-ratings. 
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different standards or different frames of ref- 
erence in dealing with the same behavior in 
themselves and in those who are opposed to 
them. 

The forces in a situation produce different re- 
actions to the same events in people on oppo- 
site sides of a question; such people are apt to 
be somewhat more realistic in evaluating dif- 
ferences between their own personalities and 
the personalities of those who are opposed to 
them. In addition, the two reactions are eval- 
uated in accordance with different sets of 
standards. Finally, in the marriage situation 
there is the impetus to identification provided 
by the force of love itself. All of this leads to 
greater similarity of report of personalities 
when we examine the statements of people 
on the same side. Put in another way, all 
of this produces invalidity in the ratings as 
measures of the personalities, considering the 
personalities objectively. 

Per cent agreement in personality ratings 
possesses potential diagnostic significance. 
Such scores also offer a potential source of 
direction to the counselor. Before this po- 
tential source becomes actual, however, fur- 
ther research is required. In particular, we 
require careful investigation of the therapeu- 
tic consequences of revealing the lack of 
realism which is so characteristic of the per- 
sonality assessments of persons who are pres- 
ently happily related. If information can be 
revealed without causing unhappy conse- 
quences and if its significance is compre- 
hended, it can result only in an increasing 
realism on the part of persons entering mar- 
riage—a consequence which should make a 
certain contribution to the promotion of mar- 
ital adjustment and the reduction of marital 
disharmony. 

The systematic variations in the intercorre- 
lations of the traits as revealed in Table 3 
provide a serious basis for doubting the valid- 
ity of such ratings as indicators of the traits 
which they purport to measure. For this 
reason it is evident that conclusions having to 
do with homogamy which are based upon 
ratings made by marriage partners of each 
other are of little value. 


SUMMARY AND CONCLUSIONS 
1. Husbands and wives consistently rate 
themselves and their partners similarly on 
specific personality traits. 
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2. While all spouses show such a tend- 
ency, happily married partners exhibit mate- 
rially higher correlation on their ratings of 
themselves and their partners than do un- 
happily married partners. 

3. The self-ratings of husbands and wives 
reveal negligible correlation. 

4. The foregoing relationships are sensibly 
independent of age, duration of marriage, 
education of the respondents, number of chil- 
dren in the respondents’ families, the marital 
happiness of the parents of the respondents, 
as reported by the respondents, and of other 
variables in respect of which the two groups 
of respondents differed in varying degrees. 

5. The foregoing relationships are mark- 
edly dependent upon differences between the 
two groups which are direct reflections of the 
quality of their marital adjustment. 

6. Under the assessment conditions obtain- 
ing in this investigation, happily married per- 
sons show more evidence of lack of realism 
in personality appraisals of themselves and 
their partners than unhappily married per- 
sons. 

7. It is suggested that the foregoing results 
are a direct consequence of the fact that 


people on opposite sides of a conflict situation 
have more opportunities to take note of their 
opponent as different rather than similar to 
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themselves, whereas persons with strong af- 
fective feelings (such as love) promoting a 
wish for identification, tend to see their part- 
ners as similar rather than dissimilar to them- 
selves. 

8. Practical applications of the instruments 
developed for this research are suggested at 
the levels of diagnosis and counseling of 
maritally maladjusted people. 

g. The ratings studied in the present in- 
vestigation, while probably valid as indicators 
of the feelings of the individual making 
them, lack validity as indicators of the traits 
which they purport to measure. 
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THE EFFECT OF INSTITUTIONALIZATION ON ATTITUDES 
TOWARD OLD PEOPLE’ 
JACOB TUCKMAN anv IRVING LORGE 2 


Teachers College, Columbia University 


N a previous study (1) the authors investi- 
gated the attitudes of graduate students 
toward old age as measured by their re- 

sponses to a questionnaire of 137 statements 
about old people. These statements were 
classified into 13 categories: Physical, Finan- 
cial, Conservatism, Family, Attitude Toward 
the Future, Insecurity, Mental Deteriora- 
tion, Activities and Interests, Personality 
Traits, Best Time of Life, Sex, Cleanliness, 
and Interference. With the exception of 
statements dealing with physical change, for 
which there exists some experimental evi- 
dence, most of the statements reflected popu- 
lar misconceptions and stereotypes about old 
people. The findings indicated that these 


graduate students subscribed substantially to 
the misconceptions and stereotypes about old 
people despite the fact that they constituted 
a group of sophisticated young adults with 
a good grounding in psychology who showed 


sufficient interest in the problems of aging 
to register for a course on the psychology of 
the adult. The responses to the individual 
statements revealed no age or sex differences. 


METHOD 


The purpose of this study is to compare the re- 
sponses of three groups of older age subjects (Ss) 
to the same questionnaire: a group living in the 
community, a group living in the traditional type of 
institution for the aged, and an intermediate group 
living in an apartment house which serves to bridge 
the gap between community and institutional liv- 
ing. The comparison of these groups involves the 
hypothesis that as individuals become less able to 
cope with the problems of day-to-day community 
living they will subscribe more to the misconcep- 
tions and stereotypes about old people. 


1 Retirement and Adjustment Series: Number 4. Spon- 
sored cooperatively by The Institute of Adult Education 
and the Institute of Psychological Research, Teachers, 
College, Columbia University. 

2 The writers are indebted to Dr. Frederic D. Zeman, 
Chief of Medical Service, and Miss Marie Galpern, 
Director, Social Service, The Home for Aged and Infirm 
Hebrews of New York; and to Miss Gertrude Landau, 
Head Worker, William Hodson Community Center, for 
their cooperation in this study. 
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Subjects 


The community group consisted of 21 Ss (11 men 
and 10 women). Seventy-five per cent of the group 
were single or widowed and living in furnished 
rooms. Eighty-five per cent were receiving some 
type of public assistance. During the day these Ss 
attended the William Hodson Community Center, 
a center where older men and women can get to- 
gether for recreational, social, and educational activi- 
ties. The Ss ranged in age from 60 to 80 years with 
a mean age of 71.8 years. None of the group was 
in the labor market. The occupations in which they 
had engaged prior to retirement were primarily 
in unskilled, semiskilled and sales jobs, and in small 
businesses. The educational background ranged 
from no schooling to college graduation, with a 
median of 8 years of schooling. 

The apartment house group consisted of 20 Ss (5 
men and 15 women) living in an apartment house 
under the jurisdiction of The Home for Aged and 
Infirm Hebrews of New York. Residents of this 
project had to meet the same eligibility require- 
ments as residents of the traditional type of institu- 
tion. One of the most important eligibility require- 
ments was the inability, for one reason or another, 
of the individual to function adequately and inde- 
pendently in the community. However, these Ss 
were permitted to live in the apartment house 
rather than in the more formal institution because 
they were considered by the institutional authori- 
ties to be healthier both physically and psycholog- 
ically than residents of the traditional type of in- 
stitution. The apartment house does not carry the 
stigma of an old age home and there is little or no 
restriction on the activities of the residents. The Ss 
ranged in age from 65 to 84 years, with a mean 
age of 74.3 years. The educational and socioeco- 
nomic status of these Ss was much superior to the 
other two groups in this study. The educational 
background of the group ranged from no schooling 
to college graduation with a median of 12 years 
of schooling. The occupations pursued previous to 
retirement were in the professional, semiprofes- 
sional, and high level clerical occupations. 

The institutional group consisted of 48 Ss (29 
men and 19 women) living in two different build- 
ings under the auspices of The Home for Aged and 
Infirm Hebrews. In one building there were 21 Ss 
whose average stay at the institution was 3 months. 
In the other building there were 27 Ss whose aver- 
age stay at the institution was 5% years. The age 
range, previous education, and previous occupation 
were similar in both groups. The combined group 
ranged in age from 61 to 88 years, with a mean 
age of 76.3 years. Previous education ranged from 
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no schooling to college graduation, with a median 
of 9 years of schooling. The occupations in which 
they had been engaged prior to retirement were in 
small businesses, and in sales, skilled, and semi- 
skilled occupations. 

None of the groups in this study represents an 
adequate cross section of older age individuals liv- 
ing in the community or in institutions. All were 
volunteers who were required to have a sufficient 
knowledge of English to fill out a written question- 
naire. Individuals in very poor health or those not 
possessing adequate hearing and vision were ex- 
cluded. In view of the fact that the same criteria 
were employed in selecting each of the groups, it 
seems reasonable to compare their responses on this 
questionnaire. The age distribution was similar in 
all groups. The community and institutional groups 
were fairly well equated for previous education and 
socioeconomic status, but the apartment house group 
was superior to the other groups in these respects. 


Procedure 


The questionnaire was administered to Ss in 
group sessions. The average time required to com- 
plete the questionnaire was about 30 minutes, or 
twice as long as that required by younger Ss with 
more schooling. There was a wide variation, rang- 
ing from 15 minutes to 1 hour and 10 minutes. 
During testing sessions of all the groups Ss did 
not hesitate to ask questions or to express verbally 
their reactions to individual statements. 


REsuLts AND Discussion 


The mean number of yes responses and 
standard deviation for the community and 
apartment house groups and the two sub- 
groups comprising the institutional popula- 
tion are presented in Table 1. 


TABLE |! 


Mean Numpser or Yes Responses AND STANDARD 
DeviaTION FoR ComMuNITY, APARTMENT Howse 
AND INSTITUTIONAL Groups ON OLp PEopLEe 
QUESTIONNAIRE 


MEAN SD 


Group NuMBER 





Community 

Apartment House 

Combined Institution 
Institution (3 months) 
Institution (5% years) 

Apartment House and 
Combined Institution 


61.7 24. 
66.9 19. 
81.8 20. 
80.9 


NNN WH 
in & 


82.4 


— 
2 


77-4 





The data in Table 1 show that agreement 
with the misconceptions and _ stereotypes 
about old people tends to increase from com- 
munity to apartment house to short-term in- 
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stitution to long-term institution. The dif- 
ferences between community and apartment 
house and between short-term and long-term 
institutional groups are not significant. How- 
ever, in comparing community with each of 
the institutional subgroups or with the com- 
bined apartment house and institutional pop- 
ulations, the differences are significant at the 
or level of confidence. In comparing apart- 
ment house and institution, the difference 
between apartment house and short-term in- 
stitution is significant at the .05 level; the 
difference between apartment house and 
long-term institution and the difference be- 
tween apartment house and total institution 
is significant at the .or level. 

An item analysis of the 137 statements in 
the questionnaire indicates that there are 73 
statements which show a significant differ- 
ence between one or more groups. There 
are 20 statements where the proportion of 
yes responses is significantly lower for com- 
munity than for apartment house; 52 lower 
for community than for institution, and 34 
lower for apartment house than for insti- 
tution. The 73 significant statements classi- 
fied into 13 categories (Physical, Financial, 
Conservatism, Family, Attitude Toward the 
Future, Insecurity, Mental Deterioration, 
Activities and Interests, Personality Traits, 
Best Time of Life, Sex, Cleanliness, and In- 
terference) are given in Table 2. The pro- 
portions of the three populations in agree- 
ment with these significant statements and 
with the total number of statements within 
each of the 13 categories are also given. 

The data in Table 2 indicate that the Finan- 
cial, Insecurity, Physical, Mental Deteriora- 
tion, Personality Traits, Interference, and 
Conservatism categories differentiate the 
three populations while the Activities and In- 
terests, Sex, Best Time of Life, Cleanliness, 
Family, and Attitude Toward Future cate- 
gories do not. For the former, the percen- 
tage of yes responses increases progressively 
from community to apartment house to in- 
stitution. This trend is evident not only for 
statements showing a significant difference 
between one or more groups but also for all 
statements within a particular category, al- 
though the trend is more evident when only 
significant items are considered. 

The tendency for categories covering physi- 
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cal decline, insecurity, personality character- 
istics, mental deterioration, financial matters, 
interference, and conservatism to differentiate 
the groups more than statements in the other 
categories is not surprising since these areas 
are of greater importance in determining 
the ability of the individual to function ade- 
quately in the community. 

It appeared from observations in the test- 
ing situation that there was a tendency for 
Ss to project their own experience in respond- 
ing to statements in the questionnaire. For 
example, a woman read aloud the statement, 
“Old people have no interest in the opposite 
sex,” and then said, “I should say not. I am 
still interested in men.” To the statement, 
“Old people are hard of hearing,” a man 
with a hearing aid gave a yes response. 
There were also some Ss in each group who 
refused to answer some of the statements 
dealing with family relationships on the 
ground that it would be impossible for them 
to answer these questions because they had 
never been married or because they had no 
children. While they admitted that they had 
had an opportunity to observe relationships 
between parents and children, they refused to 
answer these questions because they did not 
know what they themselves would have done 
in these situations. There were also Ss 
who wrote in explanatory remarks on the 
questionnaire itself, which indicated the tend- 
ency to project their own experience. 

More objective evidence on this point is 
available through a comparison of several 
statements in the questionnaire with related 
questions in a medical questionnaire. Each 
of the groups was given the Cornell Medi- 
cal Index, a questionnaire covering physical 
and psychosomatic complaints, supplemented 
by a shorter questionnaire designed to cover 
questions in the same areas applicable spe- 
cifically to older Ss. There were a number 
of statements common to both the Attitudes 
Toward Old People questionnaire and the 
Cornell Medical Index and Supplement. For 
example, on the Attitudes questionnaire the 
statement, “Old people need glasses to read,” 
was compared with the question on the med- 
ical questionnaire, “Do you need glasses to 
read?” There were 72 Ss for whom both 
test records were available. The comparison 
of statements in the Attitudes questionnaire 


with questions in the Cornell Medical Index 
and Supplement, showing the proportion of 
joint agreement, is presented in Table 3. 

It is evident from the data in Table 3 that 
Ss tend to project their own experience when 
responding to statements about old people. 
Joint agreement ranged from 41 per cent to 
87 per cent. The percentage of joint agree- 
ment of yes responses is very high for state- 
ments pertaining to the need for glasses and 
loss of teeth. For statements dealing with 
helplessness and fear of the dark the per- 
centage of joint agreement is very low for yes 
responses but very high for no responses. It 
would appear that statements which are not 
threatening, such as the need for glasses and 
the loss of teeth, have a high percentage of 
joint agreement of yes responses while state- 
ments which are quite threatening, such as 
helplessness and fear of the dark, show a 
very small percentage of joint agreement of 
yes responses. 

The data support the hypothesis that as in- 
dividuals become less able to function inde- 
pendently in the community, they subscribe 
to a greater degree to the misconceptions and 
stereotypes about old age. The apartment 
house group subscribes more to these mis- 
conceptions than the community group; the 
institutional population subscribes more than 
the apartment house group. Differences be- 
tween these groups in subscribing to the 
stereotypes about old people stem not only 
from the acceptance of the cultural expec- 
tations about the role and activities of old 
people and the individual’s observations of old 
people around him, but also from the realiza- 
tion and acceptance of these statements as 
applied to themselves. The data suggest that 
the individual who agrees that old people are 
better off in old age homes, are hard to get 
along with, like to gossip, are grouchy, repeat 
themselves in conversation, usually live with 
their children, statements which most sharply 
differentiate (at the .oor level) the commu- 
nity from the combined apartment house and 
institutional groups, does so because this ap- 
pears to him to be characteristic of his asso- 
ciates and also because this is characteristic 
of himself. 
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AN EXPERIMENTAL EVALUATION OF GROUP PSYCHOTHERAPY * 
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Veterans 


HE purpose of the present study was to 

measure the effect of a relatively short- 

term group psychotherapy by means of 
psychological tests and ratings of ward be- 
havior. Measurements were taken at the be- 
ginning and at the end of a period of from 
three and a half to four months of therapy. 
This was done with two groups, namely, a 
group of colored schizophrenic patients and a 
group of white, postlobotomy patients. Peters 
was the therapist in the first, Jones was thera- 
pist in the second group. Each of the authors 
served as assistant, or auxiliary ego, in the 
other’s group. The psychological tests were 
administered to the members of both groups 
by Jones. During the time of this study, from 
February to December, 1950, all the patients 
involved were on closed wards of the Vet- 
erans Administration Hospital, North Little 
Rock, Arkansas. Group therapy, like all 


other forms of treatment in this hospital, is 
done under close medical supervision. 


PROCEDURE 


Both groups of patients met once a week for an 
hour and a half. The form of therapy used was 
essentially the same in the two groups. The therapist 
maintained an atmosphere of tolerance and _ per- 
missiveness, encouraging as much spontaneity from 
the patients as possible. Part of each period was 
usually spent in open discussion of any topic brought 
up by a patient. Most sessions, however, pivoted 
around one or more psychodramas which were 
developed as far as possible within the group. No 
rigid pattern was followed. At all times the thera. 
pist attempted to get the patients to create their 
own dramas during the period of the meeting. 
Occasionally the therapist would come to a meeting 
with a particular psychodrama planned and pointed 
at one patient. The drama situation would be one 
revolving around an interpersonal relation with 
which the patient was apparently having difficulty. 
The therapist got his material for these planned 
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dramas from the patient’s ward folders, from ma- 
terial brought out in the group meetings, from 
individual interviews with the patient, and from 
information furnished by ward personnel, social 
workers, and ward physicians. The various drama 
techniques described by Moreno (2, pp. 1-18), such 
as, substitute role, reversal of roles, mirror tech- 
nique, symbolic distance, were used. Volunteer 
workers, aides, nurses, and psychological interns 
were integral parts of these groups and took part in 
all activity, including role playing. 

The therapist intentionally avoided anything in 
the way of symbolic interpretation or depth therapy. 
After each drama the patients were encouraged 
to discuss anything which came to their minds, 
This was usually a thrashing out of the obvious 
social features of the drama. The patients gave 
one another advice; the therapist never gave advice. 
The only time that the therapist assumed dominance 
was wher he was starting a psychodrama, and then 
he permitted the patient involved to structure all 
details. As far as possible the action was confined 
to portrayal and solution of interpersonal relations 
within the group, on the ward, or in the hospital 
community. It is the belief of the authors that, 
although intimate family relationships may be 
the primary causes of the patient's difficulty, he can 
get insight into these and transfer to them the solu- 
tions worked out to everyday interpersonal rela- 
tionships which lie at the periphery of his social 
atom. After all it is his failure to reach a solution 
to these, and not the timeless relations to his 
mother and father, which is the present, immediate 
source of trouble. 

The psychological test battery, which was ad- 
ministered individually before and after the therapy 
period, consisted of the Porteus Mazes, the Mirror- 
Tracing test, the Rorschach test, and the Draw-a- 
Man test. The choice of these tests was based on 
a number of considerations. One factor was the 
characteristics of the patients, especially in the 
colored schizophrenic group. These patients were 
not highly literate, although most of them could 
read. Their cultural background and the nature 
of their mental disorder severely limit the kind and 
number of tests on which uniform cooperation can 
be expected. A second factor considered was time. 
It was desirable that the whole test battery be 
one which could be administered in a single inter- 
view. The average stay in the hospital of these 
patients is close to four months, and thus therapy 
had to begin shortly after their admission if it 
was to be of the desired duration. A long battery, 
one which would run over several interviews, would 
delay the beginning of therapy, and consequently 
shorten its duration. It was also desirable to have 
a short testing period at the end of therapy, since 
the patient's release often came about with little 
warning to the psychologist and under general 
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conditions of urgency. A third consideration was 
the nature of the tests themselves and what they 
ostensibly measure. The Porteus Mazes and Mirror 
Tracing were chosen as instruments which ap- 
proach the stability of a patient from the side of 
overt adjustment responses to external problem 
situations. The Rorschach and the Draw-a-Man 
take an opposite approach, from the verbal-ideational 
side. These tests are to some degree complements 
of one another. With the performance tests the 
individual is required to make an overt adjustment 
within the limits of certain rules of conduct; with 
the projection tests no rules are laid down, the 
problems are not primarily ones of adjustment, but 
rather of daydreaming or giving expression to 
fantasy life. 

The Gardner Behavior Chart (6) was filled out 
twice by ward personnel on each patient at the 
same times that his psychological tests were given. 
This set of rating scales gets at the patient’s day- to- 
day adjustment to the social environment of the 
hospital. With the addition of this measurement 
to the others the authors believe that three broad 
facets of the patient’s behavior were assessed: (a) 
overt adjustment to reality problems, (4) fantasy 
productions, and (c) social adjustment. 

The two groups which were used in this study 
were in many respects quite different. The per- 
sonnel of the colored schizophrenic group was con- 
tinually changing, the majority of patients remain- 
ing only three or four months. On the other hand, 
the lobotomy group was made up of eight patients 
who had been hospitalized for a number of years 
and had all been living together for some time on 
the same ward. The personnel of this group re- 
mained constant during the period of the investi- 
gation. The colored group had been in operation 
for a year when the present investigation was 
undertaken; therapy with the lobotomy group was 
started at the time this study began. 

The procedure used in this study was designed 
primarily for the colored group. These patients 
were all housed in one building, were attended by 
the same ward personnel, and remained in the 
hospital a period of time which was deemed ade- 
quate for investigation of group psychotherapy. 
Within his first week in the hospital, every colored 
patient who was provisionally diagnosed schizo- 
phrenic was examined with the battery described 
above, and if he was not mentally deficient was used 
in the experiment. Those who were accepted 
were allocated alternately to a Test and a Control 
The Test patients were seen once a week 
in group psychotherapy, otherwise the hospital 
treatment for both groups was the same. The 
great majority of patients in both groups received 
a course of electric shock treatments. During a 
patient’s first week in the hospital four members 
of the ward personnel, one nurse and three aides, 
filled out a copy of the Gardner Behavior Chart 
As soon as they were completed the four copies of 
the chart were returned to the psychology depart- 
ment. It was the original intention to give each 
patient his second psychological examination, and 
to have each rated on the Gardner Behavior Chart, 
after a four-month period, but since some patients 
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were released for various reasons before the antici- 
pated interval had elapsed, they were either dropped 
from the study or given their second examination 
before the four-month point. If such a patient 
had been in therapy three and a half months, he 
was examined and included in the study; if his 
period of therapy was shorter than three and a 
half months, he was dropped. At the end of 
the study, after the investigation had lasted almost 
a year, twelve patients had completed the cycle in 
each group. 

The lobotomy patients were put through the same 
cycle, of four months of group psychotherapy with 
psychological tests before and after, as the colored 
Test Group. However, no control group was used 
with these patients. The investigation of this 
group was not considered a vital part of the design, 
but was undertaken only for the significance the 
results might add to the major part of the study. If 
the test changes, from first to second examination, 
in the lobotomy group are found to be essentially 
the same as those isolated by the test-control pro- 
cedure with the colored patients, it would lend 
validity to the conclusions growing out of the 
major part of the investigation. 


RESULTS 


The Colored Schizophrenic Group 


The findings on the Porteus Mazes and the 
Mirror-Tracing test, for 10 experimental and 
11 control patients, were presented in a previ- 
ous report by the same authors (4). The addi- 
tion of the records for two experimental 
patients and one control patient, who finished 
the procedure cycle since the preceding pub- 
lication, did not materially change the mean 
scores and the comparisons reported. How- 
ever, a different and more adequate statistical 
tool for treating the results has been used. 
A basic weakness in the design of this in- 
vestigation is that, owing to obvious adminis- 
tration difficulties, the patients could not be 
matched prior to allocation to Test or Con- 
trol Groups. If this had been done, match- 
ing would have been made on the basis of 
the Gardner Behavior Chart, and it is not 
likely that in such a case the correspondence 
of the two groups on other measurements 
would have been much closer. In other 
words, there would have remained the prob- 
lem of comparing two groups in final scores 

“when their initial scores were different. It 
was decided to treat the results by the method 
of simple analysis of covariance, which ad- 
justs the variance at the second testing rela- 
tive to the variance at the first testing (1, pp. 
180-196). This treatment takes account of 
the correlation between scores on the first 
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and second testing, and actually amounts to 
comparing the observed scores on the second 
testing with those to be expected from the 
regression coefficient, assuming that both 
groups are in reality drawn from the same 
population (the null hypothesis). 

Porteus Mazes. The reader may refer to 
the previous publication (4) for a compari- 
son of the means of the two groups on 
quantitative maze scores before and after the 
therapy. These comparisons are not essen- 
tially changed by the addition of three more 
patients’ records. Mean scores for the Test 
Group are now, for first and second testing, 
11.1 and 13.3 (Test Age); and for the Control 
Group the corresponding means are 9.1 and 
11.2. The difference between the Test Group 
means is statistically significant; the other 
comparisons are not significant. An analysis 
of covariance yields an F-ratio (of between 
groups to within groups variance) of only 
1.1758, which of course is not significant. 
Thus, the null hypothesis cannot be rejected 
on the basis of these data alone. 

The mean qualitative maze scores for Test 
and Control Groups, before and after the 
treatment period, did not change appreci- 


ably from those reported previously when 
the scores of the additional patients are in- 


cluded. The size of this qualitative score 
indicates inversely the degree of the patient’s 
regard for neatness and rules in carrying out 
his solution responses to the mazes. ' Features 
of his record such as crossing lines, cutting 
corners, lifting pencil, making a wavy line 
are counted for all mazes used; the score 
in the present study was obtained by divid- 
ing the total qualitative errors by the number 
of mazes used. The mean qualitative scores 
for 12 test patients and 11 control patients 
(one control patient’s record was not used 
because he failed to cooperate at the first 
testing) are, before and after the treatment 
interval, 3.95 and 1.91 for the Test Group; 
5.58 and 4.75 for the Control Group. Since 
the two groups were not initially matched 
for these measurements, an analysis of covari- 
ance was done in order to adjust the variance 
of the final scores. The results are shown 
in Table 1. The ratio of between groups 
variance to within groups variance yields an 
F of 7.785. This indicates a level of sig- 
nificance between the five per cent and one 
per cent points. 
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TABLE 1 


Quatitative Maze Score Variances ADJUSTED BY 
ANALYsIs OF COVARIANCE 





af VARIANCE 





Between Groups 35.2488 


(Test & Control) 


Within Groups 4-5278 


(Error) 





Mirror Tracing. In the preceding report a 
table was included which showed a marked 
difference between the Test and Control 
Groups in tendency to improve in mirror 
tracing from first to second testing. Although 
both showed a high number of failures on the 
first examination, the majority of Test Group 
patients improved on the second examination, 
whereas with the Control Group, just the 
opposite occurred. The additional two test 
patients passed this test the first time and 
showed improvement on the second; the ad- 
ditional control patient failed the test on 
both occasions. Ten of the 12 test patients 
improved on their second examination; only 
4 of the 12 control patients improved. The 
fourfold table, Test—Control versus im- 
proved—not-improved, yields a chi-square of 
4.2875. With one degree of freedom this 
figure indicates a p between .05 and .2, 
which is clearly significant. 

The Gardner Behavior Chart. This chart, 
which was made up especially for filling out 
by psychiatric aides and nurses, has rating 
scales for 15 different phases of the patient's 
ward behavior, such as, attention to personal 
appearance, sleep, appetite, sociability, activity 
control, cooperation in routine. Each item 
has five rating grades, namely, none, poor, 
fair, good, and extra good. The grades for 
every item are defined on the rating sheet in 
simple language. The various grades are 
weighted from zero through four. The score 
for any one rating sheet is the total of the 
15 grades received for the various items. 

At the beginning of this investigation, the 
aides and nurses working in the building 
where the colored schizophrenics were housed 
were given instructions in the use of the 
Gardner Behavior Chart. After a patient's 
admission and his acceptance in the project, 
four copies of the chart were sent to the 
Head Nurse, who had one nurse and three 
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psychiatric aides independently fill out one 
of the charts for the patient. At the end of 
the therapy period the same procedure was 
followed. Thus for each of the 24 patients in 
the experiment there were four completed be- 
havior charts at the beginning and at the 
end of the investigation cycle. The average 
of the four ratings received was taken as the 
patient’s score on the Gardner Behavior 
Chart. 

Table 2 presents the means and sigmas for 
the ratings of both groups before and after 
therapy. Comparison of these means bears 


TABLE 2 


MEAN Scores oN GARDNER BEHAviIoR CHART 


SECOND 
EXAMINATION 


First 
EXAMINATION 


M om M Om 








2.40 44.98 1.58 


Test Group 
(N=12) 


41.33 


Control Group 35.68 35.89 2.82 


(N=11) 


3-05 


out the expectations based on the hypothesis 
that the Test Group, as a result of the group 
therapy, would improve more than the Con- 
trol Group. The differences between the first 
and second ratings of the Test Group and 
between the second ratings of the two groups 
are statistically significant; the differences be- 
tween the first and second mean ratings for 
the Control Group, and between the two first 
mean ratings are not significant. However, 
here again there is the complication due to 
an inequality of the two groups at the first 
examination. An analysis of covariance, ad- 
justing the variance of the ratings after 
therapy with respect to that before, indicates 
a statistically significant difference between 
the two groups. Table 3 presents the ad- 
justed variances for between groups and 
within groups. The ratio between the two 
variances, F, is 6.6199, which has a p be- 
tween .o5 and .or. 

Rorschach. The Rorschach test was ad- 
ministered in the usual way, with first a pres- 
entation of each card and recording of re- 
sponses; any inquiry as to determinants was 
done after the patient had reacted to all 10 
cards. There was no testing of the limits. 
As usual with schizophrenic patients, the re- 
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TABLE 3 


Score VARIANCES ON GARDNER BeEHAvior CHART 
Apjyustep sy ANALYsIs OF COVARIANCE 


df VARIANCE 





Between Groups 
(Test & Control) 


154.0279 


Within Groups 
(Error) 


sponse protocols were fairly brief. In the 
Test Group the mean number of responses 
before and after therapy were, respectively, 
8.2 and 12.6; in the Control Group the cor- 
responding means were 11.3 and 13.2. Con- 
sidering both groups together the range in 
number of responses at the first testing was 
from 2 to 24; on the second testing, from 
3 tO 33. 

A number of scoring variables were exam- 
ined in the records of both Test and Control 
Groups for evidence of reliable difference in 
change from first to second examination. 
Many scoring factors, such as movement and 
shading reactions, occurred so infrequently 
in the protocols that a plausible comparison 
was out of the question. For the eight fac- 
tors examined in detail, although the com- 
parisons seemed to favor the hypothesis of 
greater improvement in the Test Group, 
every difference was either very small, or did 
not stand the test of rigorous statistical 
analysis. The following is a list of these eight 
scoring factors, with a summary of the trend 
in results. 


1. Average reaction time of first response to a 
card. The Test Group showed an average decrease, 
whereas the Control Group showed an average 
increase in reaction time. There was a decrease in 
this measure for eight of the test patients, four of 
the control patients. 

2. Number of responses. Although in both 
groups there was a mean increase in number of 
responses from first to second examination, this 
increase was greater in the Test Group. Nine of 
the 12 Test patients increased their number of 
responses on the second examination; six of the 12 
Control patients increased. 

3. Number of content categories. Nine of the 
Test patients showed an increase in number of con- 
tent categories used at the second testing; only five 
of the control patients showed a similar increase. 

4. Sum C. This scoring factor was calculated in 
the usual way for every record. Comparison of the 
records for the Test Group, before and after, re- 
vealed an average increase; the records for the 
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Control Group, a slight average decrease. Six test 
and five control patients gave a larger Sum C on 
their second examination. 

5. Number of whole responses. There was an 
average increase in the Test Group, an average 
decrease in the Control Group. Eight of the test 
patients gave more whole responses on their second 
examination, while only four of the control patients 
increased in this respect. 

6. Number of D responses. Although both 
groups showed an average increase, it was greater 
for the Test Group. Nine of the test patients in- 
creased in number of D, while eight of the control 
showed an increase. 

7. F+ %, deviation. This score was obtained by 
taking the deviation of the patient’s F+ %, regard- 
less of sign, from an ideal of 80%. The mean 
decrease in this measure was greater for the Test 
than for the Control Group. Nine of the test 
patients decreased, seven of the control patients 
decreased from first to second examination. 

8. Number of P responses. Both groups showed 
an increase in number of P responses, with the 
increase much greater in the Test Group. Nine of 
the test patients showed this increase; six of the 
control patients showed it. 


Directions of change (increase or decrease) 
in five of these scoring factors seemed to 
the authors to have little ambiguity as indi- 
cators of improved mental condition. All 24 
patients were scored as showing the presence 


or absence of these five changes, which were 
(a) an increase in number of responses, (4) 
a decrease in average reaction time, (¢) in- 
crease in number of content categories, (d) 
increase in number of P responses, and (¢) 
a decrease in the deviation from 80% F+. 

Each of these directions of change from 
first to second examination was shown with 
greater frequency by patients in the Test 
Group than in the Control Group. Five 
patients in the Test Group showed all five 
of these “signs of improvement”; none of 
the control patients showed all five of them. 

Three judges, one of the authors (Peters) 
and two fourth-year graduate students, rated 
the 24 pairs of Rorschach protocols with re- 
spect to which member of a pair showed more 
evidence of personality stability or less evi- 
dence of active psychosis. The judges, of 
course, were not aware of which of two pro- 
tocols was obtained before and which after 
the therapy interval, although they were 
aware that the pairs of records were given by 
the same patient. The 48 Rorschach proto- 
cols were typed up without the patient’s name 
or date of administration. The pairs for each 
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patient were clipped together in half of the 
cases with the first record on top, in the other 
half on the bottom. The records for Test and 
Control Groups were shuffled in one stack. 
The judges were instructed to take the pairs 
of protocols, study both of them, make a de- 
cision as to which was the better record with 
respect to indications of stable personality, 
and to clip them back together with the one 
judged to be the better on top. 

Agreement between the judges was not as 
high as was expected. The percentages of 
agreement between pairs of judges were: 71 
per cent, 71 per cent, and 83 per cent. In 
both groups the judges’ ratings indicated that 
in the majority of cases, the second Rorschach 
protocol was the better. However, this tend- 
ency was greater with the Test Group than 
with the Control. Two or three of the judges 
rated the second protocol higher for 11 of the 
test patients, eight of the control patients. 
All three judges agreed in rating the second 
protocols of seven of the test patients as show- 
ing better personality integration, whereas all 
three agreed on the same for only five of the 
control patients. 

The Draw-a-Man Test. In the administra- 
tion of this test the patient was given a blank 
piece of paper and a pencil and simply told to 
“draw a man, standing up, facing you, with- 
out clothes.” A drawing was obtained at both 
examinations from 12 of the control patients 
and 11 of the test patients. The same three 
judges, who were used with the Rorschach 
protocols, rated these drawings in the same 
manner, judging which of a pair showed the 
greater “organization of the self-concept.” 
The percentages of agreement between 
judges, for the 23 pairs of drawings, were 78 
per cent, 74 per cent, and 78 per cent. 

With respect to the basic problem of com- 
paring degree of improvement in the two 
groups, the results from this test were nega- 
tive or even slightly in favor of the Control 
Group, although none of the comparisons 
was statistically reliable. Two or three of the 
judges rated the second drawing as better in 
six of the 11 test patients, and nine of the 12 
control patients. All three judges agreed in 
rating six of the 12 control patients as show- 
ing better integration in their second draw- 
ing, while the three were in similar agree- 
ment on five of the test patients. 
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The Lobotomy Group 


This group consisted of eight male, white 
patients who at some time during the pre- 
ceding two years had received a prefrontal 


lobotomy operation. The time interval be- 
tween the operation and beginning of this 
study was two months in the case of one pa- 
tient; this interval exceeded six months for 
the other patients. These patients had been 
in the hospital for extended periods of time, 
were old schizophrenics, and had been previ- 
ous to their operation serious ward manage- 
ment problems. Number of years in the hos- 
pital at the time this study began ranged from 
six to 27; three of the patients had been in the 
hospital longer than 23 years. Their ages 
ranged from 35 to 63. 

In almost all respects these patients were 
quite different from the colored group. They 
were extremely passive, lacking in spon- 
taneity, and showed the typical postlobotomy 
low level of initiative. Few patients in this 
group could be gotten to produce, spon- 
taneously, interpersonal material upon which 
2 psychodrama could be based. Consequently, 
progress notes of their ward behavior and 
social service records of their history were 
used. Psychodramas involving interpersonal 
relations, which were obvious in their records, 
were planned and carried out in the group 
sessions. Discussion was not discouraged, but 
very little of it occurred, especially at the 
beginning. As time went on the patients 
showed obvious improvement in their inter 
personal relations within the group, in their 
ibility to assume role playing of varying de- 
grees of complexity, and in their ability to 
take part in discussions. 

The purpose of including these lobotomy 
patients in this research was to verify the 
changes found in the colored schizophrenic 
Test Group. The group was too small to 
assess statistical reliability, and no attempt 
In the 
following presentation of results, it will be ob- 


was made to use a control group. 


served that in general the lobotomy patients 
showed the same trends and changes from 
first to second testing that were shown by the 
colored schizophrenic group. This offers con- 
siderable support to the validity of these test 
effect of the 


changes as measures of the 


therapy. 
It was the original intention to use the 
Gardner Behavior Chart with the lobotomy 
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group in the same way it was used with the 
colored patients. However, as a result of mis- 
understanding and changes in ward person- 
nel the ratings were not made out adequately 
and hence had to be discarded. 

The Porteus Maze Test. Table 4 presents 
the mean quantitative and qualitative scores 
before and after the period of group psycho- 
therapy. The number is only seven for the 
qualitative scores, because one patient refused 
to take the test on the first administration, 
and consequently his qualitative score could 
not be calculated. As in the case of the col- 
ored Test Group, the quantitative score rises 
and the qualitative score falls markedly after 
therapy. Of the 12 patients in the colored 
Test Group, nine increased their quantitative 
score on the second examination; seven of 
the colored Control Group increased theirs. 
Five of the eight lobotomy patients raised 
their quantitative score on the second ex- 
amination. 

TABLE 4 
MEAN QUANTITATIVE AND QUALITATIVE Maze Scores 


oF Losotomy Patients BEFORE AND 
Arter Group THERAPY 





Sec OND 
EXAMINATION 


First 
EXAMINATION 





Quantitative Score 
(N=8) 

Qualitative Score 
(N=>=7) 





The Mirror-Tracing Test. Table 5 pre- 
sents the pre- and posttherapy total times for 
two trials at tracing the star pattern by in- 
direct vision. Only one of the patients, the 


TABLE 5 


Totat Time 1n SeEconps AND FarLures aT Mirror 
Tracinc By INprvipuaLt Losotomy Patients 
Berore AND AFTER THERAPY 





SECOND 


PATIENT EXAMINATION EXAMINATION 





355 302 
424 Failure 
555 101 
658 489 
Failure 767 
Failure 509 
Failure Failure 
Failure 311 
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one numbered 2 in the table, did poorer on 
the second examination than he did on the 
first; another patient, number 7, failed at 
both examinations. The remaining six pa- 
tients all improved in their performance on 
the second examination. The trend of results 
shown in this table is clearly in agreement 
with the results obtained with the colored 
group which received group therapy and in 
contrast with the results of the colored con- 
trol group. 

Rorschach. This test was administered to 
the lobotomy patients in the same manner as 
with the colored schizophrenics. Their pro- 
tocols were very brief; mean number of re- 
sponses at the first examination was 8.2, at 
the second examination, 12.1. 

The records of these patients were com- 
pared on the two examinations with respect 
to the same eight scoring items investigated 
in the records of the colored patients. The 
following summary of the findings shows, as 
in the case of the colored Test Group, changes 
which can be plausibly interpreted as evi- 
dence of improvement. 


1. Average reaction time of first response to a 
card. All eight patients decreased their average 
reaction time on the second examination. 

2. Number of responses. Seven of the eight 
lobotomy patients gave more responses on their 
second examination than they did on the first. 

3. Number of content categories. Five of the 
eight patients increased the number of content cate- 
gories used; only one patient decreased in this 
respect. 

4. Sum C. 
two decreased. 

5. Number of whole responses. Seven of eight 
patients increased their number of whole responses, 
one gave the same number on both testings. 

6. Number of D responses. Four of the eight 
lobotomy patients showed an increase in number 
of D’s. Three patients decreased in number of D's 
on the second testing. 

>. F+ deviation. Six of the eight patients 
decreased in their deviation from 80°, F+. 

8. Number of P responses. Five patients in 
creased in number of P’s, only two decreased. 


Four patients increased in Sum C, 


The three judges who were used in rating 
the Draw-a-Man and Rorschach records for 
the colored schizophrenics also rated the same 
records for the lobotomy group. The differ- 
ences between paired records (first and sec- 
ond examinations) for the lobotomy group 
were apparently more marked, since the 
judges agreed among themselves 100 per cent. 
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In the case of seven of the eight patients, all 
three judges agreed on rating the second 
protocol as showing better personality inte- 
gration. They all agreed in showing that 
one person’s second record was not as good 
as his first. 

The Draw-a-Man Test. The three judges 
also agreed 100 per cent among themselves 
in their rating of lobotomy patients’ draw- 
ings. They all agreed on rating the second 
drawing of four patients as distinctly better 
than the first; and they were in agreement 
in rating the second drawing as worse in 
the case of the other four patients. 

The reader will note that, as it was in the 
case of the colored schizophrenic group, this 
test differs from the others in that it fails to 
show the change toward improvement in the 
majority of the group following therapy. 


or Group PsYCHOTHERAPY 


Picture Sorting Test. A full-face, four by five 
inch photograph was taken of each of the lobotomy 
patients. Before and after the therapy interval 
each of the patients individually sorted the pictures 
of the other seven members of the group under the 
same conditions. In administering this test, the 
patient was seated at a table with three cards 
before him, the one on the left with “Like” printed 
on it, the one on the right had “Dislike” printed 
on it, the card immediately in front of the patient 
had the word “Neither” printed on it. It was 
explained to the patient in some detail that he was 
to go over the pictures one at a time, decide 
whether he liked the individual represented, did 
not like him, or was indifferent, and to place the 
picture before the appropriate printed card. The 
pictures, of course, were shuffled for each patient. 

The purpose of this test was to determine the 
presence or absence in the members of the group 
of affective reactions toward one another. The 
hypothesis in mind was that this form of psycho- 
therapy would increase the affective reactions (like 
and dislike). It was not expected that the number 
of likes would necessarily increase. Especially with 
this type of patient the appearance of any affective 
reaction toward others is to be taken as a sign of 
improvement. Their usual condition is one of com- 
plete affective isolation toward others, with the 
possible exception of a few of the ward personnel. 

The expected shift in affective judgments among 
the patients occurred. At the first rating the num- 
ber of pictures placed in the like, dislike, and 
neither categories respectively were 22, 10, and 32. 
After the therapy the corresponding numbers were 
30, 15, and 19. Five of the patients placed fewer 
pictures in the neither category on the second sort- 
ing; two of the other patients placed the same 
number in the neither category on both sortings; 
only one patient increased in the number of pic- 
tures placed in the indifferent category. 
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SUMMARY AND Discussion 


The investigation described in this paper 
was a test of the hypothesis that group psy- 
chotherapy using action methods has measur- 
able effects on patients’ behavior. From this 
hypothesis it was deduced by the investiga- 
tors that certain changes in a patient’s per- 
formance as measured by the various iastru- 
ments used would appear following the 
therapy interval. The null hypothesis with 
respect to each of the test measures was tested 
by comparing the changes from first to sec- 
ond measurements in a Test and Control 
Group of colored schizophrenic patients. 

Statistically reliable results were obtained 
with three variables: The qualitative score on 
the Porteus Mazes, ease of performing the 
Mirror-Tracing test, and total ratings on the 
Gardner Behavior Chart. All three of these 
are fairly direct measurements of the patient’s 
overt behavior, and the score involves a mini- 
mum of interpretation on the part of psy- 
chologists. The Mirror-Tracing test is clearly 
the least ambiguous of all, since it is largely 
a matter of doing or not doing the prob- 
lem. There is little ambiguity to the quali- 
tative score since it is obviously a measure- 


ment of the adequacy and acceptability of the 
manner in which the patient threads the 


mazes. Of course the Gardner Behavior 
Chart involves interpretation on the part of 
the ward personnel who filled them out but 
when four different individuals’ ratings are 
pooled for any one patient there can be little 
doubt that the average total rating on fifteen 
items of ward behavior is an objective indi- 
cator and actually reflects the degree to which 
the patient is making a biological and social 
adjustment to the hospital environment. 

In all other measureraents except one the 
deduced difference in change of measurement 
occurred, but was not found to be statistically 
significant, and thus the null hypothesis could 
not be entirely rejected. This was true of 
the quantitative Maze score. This score in- 
volves interpretation of observed behavioral 
characteristics in that it is expressed in men- 
tal age form. Changes in Rorschach vari- 
ables which one would expect to go along 
with improved mental condition, were found 
with greater frequency in the Test group 
for the following: Reaction time, number of 
responses, number of content categories, Sum 
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C score, number of whole responses, and 
number of P responses. Although none of 
these separate measurements is reliable statis- 
tically the probability that all six of them 
should occur in the expected direction (and 
no variables found to give changes in the 
opposite direction) on the basis of chance 
variation only seems to be rather slim. 

With a group of lobotomy patients, who 
received the same form of psychotherapy for 
the same length of time, these measurements 
showed changes occurring in the same direc- 
tion as in the colored Test Group. Qualita- 
tive and quantitative scores on the Porteus 
Mazes changed in the same directions; 
Mirror-Tracing performance was distinctly 
improved; and, with the exception of the 
Sum C score, the same changes in Rorschach 
scoring variables occurred in the majority of 
these patients. It does not appear likely that 
this similarity in changes of measurements 
should occur by coincidence alone. 

One measurement was employed with the 
lobotomy group which was not used with the 
colored schizophrenics. This was the sorting 
of pictures of fellow-patients for affective 
value. This test also yielded the expected, 
though unreliable, change after therapy, 
namely, a decrease in judgment of indiffer- 
ence. 

Only one test did not yield results in accord 
with the deduction from the hypothesis that 
the therapy improved the patients. This was 
the Draw-a-Man test which did not distinctly 
show a trend one way or the other in either 
the colored schizophrenics or in the lobotomy 
patients. This test, of course, is highly sub- 
jective and its use depends heavily on psy- 
chologists’ interpretations. The better than 
chance agreement among psychologists’ in- 
terpretations of the drawings does not neces- 
sarily mean that these interpretations reflect 
anything distinctly relevant to improvement 
in the reality of the patients’ conditions. 

The most convincing evidence in this in- 
vestigation of an improvement in mental con- 
dition, even though slight, following group 
psychotherapy is the results of the Gardner 
Behavior Chart. There can be little question 
that ratings on this instrument actually do 
reflect the patient’s level of social adjustment. 
However, two of the psychological test meas- 
urements, which yielded reliable results, the 
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qualitative Maze score (5), and the Mirror- 
Tracing performance (3), have fairly well- 
established validities as measures of social 
adjustment. 

The differences in the psychological test re- 
sults, which consist primarily of differences 
in reliability of results and not of differences 
in significance or direction of change (with 
the exception of the Draw-a-Man test), have 
two possible explanations. One explanation 
is that the reliabilities of the scores vary; the 
other explanation is that the effects of the 
form of group psychotherapy used is variable 
in different phases of the patient’s behavior, 
phases which are tapped by the different tests. 
In the opinion of the authors it is likely 
that both these explanations are to some de- 
gree valid. The most reliable results were 
obtained with objective tests while the least 
reliable results were with projective tests 
which heavily weigh the subjective factors in 
both the examiner and examinee. Whether 
or not the Rorschach is a very reliable measur- 
ing instrument with all types of patients need 
not be raised. The vital question concerns 


the type of patients used in this study, schizo- 
phrenics, who give such brief protocols that 


they can hardly be expected to have a great 
deal of reliability. 

The chief characteristics of the form of 
psychotherapy used in this study are (a) 
its brevity, (6) heavy stress on overt activ- 
ity, and (c) a stress of social adjustment at 
the level of overt behavior. Therefore, the 
effect of this form of therapy should be first 
and most strongly felt at the overt behavioral 
level. It has already been pointed out that 
those tests that gave the most reliable results 
were the ones which assess the overt be- 
havioral adjustment of the patient. 

In this investigation the psychological tests 
were administered by the same psychologist 
who was present at the group psychotherapy 
meetings. This brings up a possible confu- 
sion in interpretation of results of these tests, 
since it might be maintained that the results 
are due tc either the therapy itself, or to ex- 
aminer familiarity. There is little doubt that 
in the lobotomy group this confusion has a 
definite basis, since the psychological exam- 
iner was also the therapist with this group, 
but in the colored schizophrenic group it 
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is not very likely that the role the examiner 
played in the group increased the test group 
patients’ familiarity with him to a very great 
extent over the familiarity resulting from the 
first psychological examination, which was, 
of course, present in both Test and Control 
Groups. The examiner consciously remained 
passive in this colored schizophrenic group, 
taking an active part only when called upon 
by the therapist to play a role in a psycho- 
drama. There were other nonpatients present 
in these groups, social workers, nurses, aides 
and volunteer workers who were also called 
upon to play roles in psychodramas. In other 
words, the added examiner familiarity at the 
second examination as a result of his presence 
in the Test Group could not have been very 
great. 

This confusion in interpretation is not pres- 
ent in the results of the Gardner Behavior 
Chart. These results show that the patients 
in the Test Group did improve as a result 
of therapy. Hence, the question is not did 
the patients actually improve more in the 
Test Group. Rather the question is, are the 
resultant psychological test changes due to 
this improvement or to the additional fa- 
miliarity with the examiner. Since the 
changes uniformly, with one exception, are 
those expected to follow improvement, it is 
most plausible to choose the interpretation 
that they are caused by the known improve- 
ment. Unless one makes the unlikely postu- 
late that the test effects of examiner famili- 
arity and improved mental condition are the 
same, there should be little question that the 
test results found are caused by the improved 
mental condition. 
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INTELLIGENCE SCORES AS A FUNCTION OF EXPERIMENTALLY 
ESTABLISHED SOCIAL RELATIONSHIPS BETWEEN 
CHILD AND EXAMINER 


ELINOR L. 


SACKS * 


Rad« liffe College 


OMEWHERE near the beginning of almost 
any book, article, or monograph con- 
cerned with the measurement of intelli- 

gence, there appears a rather general statement 


to the effect that it is important for the tester 
to establish good rapport. In some cases a 
few suggested techniques for paytting the sub- 
ject at his ease will follow such a statement, 
but beyond these the reader is left to draw up 
his own definitions by inference. What rap- 
port is in the intelligence-testing situation is 
not clearly defined in such books, nor are 
there described any attempts to examine sys- 
tematically its operation or its effects upon 
the results achieved. 

This study, then, constitutes a tentative, ex- 
perimental examination of the variable of 
rapport, one of whose purposes is to discover 
whether or not testing procedure, as it is 
standardized at present, fully takes this factor 
into account. 

It had been our original hypothesis that the 
nature of the relationship between examiner 
and subject affects performance on an intelli 
gence test. Upon examination of our experi- 
mental results, it became apparent that there 


was an additional variable involved: that of 


having any rel 


itionship at all, outside of the 
testing situation. We therefore introduced 
the factor of familiarity with the tester into 
our analysis. 

Implicit in this study is the assumption that 
the IQ is not absolutely constant. 
for the ratio of mental age to chronological 
age to maintain itself without change, the 
individual rate of mental growth would have 


In order 


to be uniform. That this is not the case is 
seen in the results of the many research 
studies on the problem of IQ constancy. That 
Richard I 


invaluab! 
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the IQ varies is known; that its variation is 
limited is also known. We wish, in this 
study, to investigate the influence of experi- 
mentally established social relationships on 
the intelligence quotient as measured by 
standard tests. 

An extensive search of the literature on 
changes in IQ produced only a small number 
of articles which could be grouped under the 
rather broad general heading of “social-emo- 
tional area” or “rapport.” All of these stud- 
ies are possibly relevant to the problem of 
rapport; not one of them examines the con- 
cept itself. 

There is one exception to this statement: 
an experiment reported in the Journal of 
Negro Education for 1936 (1). The author, 
H. G. Canady, attempted to vary rapport by 
having Negro and white examiners adminis- 
ter intelligence tests to both Negro and white 
children. A clear group difference resulted, 
the Negro children increasing and the white 
children decreasing when tested by the Negro 
examiner, but individual changes were scat- 
tered. 

Several writers mention familiarity with 
the examiner as a possible cause of gain in 
IQ on retest; the only empirical test of this 
factor found is a study by Marine (3). She 
had other examiners give an intelligence test 
. the 


to two groups of children. Then, “ 
familiar 


group which was to become the 
one ... [was] put into situations with the 
examiner [Marine] which would be most 
conducive to developing the desired degree 
of ease and happiness in her presence” (3, p. 
12). 

Marine reports: 

The experimental coefficients show in a 
clear-cut manner that the writer did not secure a 
difference between the familiar and unfamiliar 
groups in gains in points of IQ.... 

The first objective conclusion to be drawn, then, 
is that the degree of rapport ordinarily secured by 
the writer when testing an unknown little child 
with the Stanford-Binet elicited as high an intelli- 
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gence rating as when there was a considerable 
degree of familiarity between her and the child... . 

Subjectively, the writer felt that the experimen- 
tal periods did leave a residue of attitude with the 
experimental group that varied in different children 
but was absent entirely from the members of the 
control group (3, pp. 26-27). 

This study is the only one which is directly 
analogous to our present experiment; famili- 
arity with the tester is one of the variables 
considered by the writer. The present study 
goes beyond this to vary the nature of the 
social relationship within the group with 
which the experimenter was familiar (i.e., 
the experimental groups). 

Previous writers, with the exception of 
Marine and Canady, have not attempted to 
control rapport. Had every examiner estab- 
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Form L was used on the initial test in all cases. 
As soon as the testing was completed, the experi- 
mental period began. For ten days the tester 
worked as an assistant teacher in each experimental 
group for about an hour a day. 

In the experimental period, the condition being 
varied was the relationship between tester and 
children. In Group A, the attempt was made to 
establish a “good” relationship. This does not mean 
an indulgent attitude on the part of the tester; it 
does mean that she behaved as her “best nursery- 
teacher self,” showing interest in, and enjoyment 
of, the children, talking with them, and playing 
games with them. 

In Group B, the writer tried to establish a “poor” 
relationship. She sat off to one side a good deal of 
the time, found excuses not to give assistance W hen 
it was requested (and the requests rapidly di- 
minished in frequency), wore a dull facial ex- 
pression, and generally tried to get the children to 
dislike her without distrusting her. After the ex- 


TABLE |! 


MEANS 


Pre-EXPERIMENTAL 
Test 


Group S.D. 


139.9 
128.6 


124.7 


lished a good social relationship prior to test- 
ing, comparability would not be lost; or had 
every examiner limited the contacts with his 
subjects to the actual testing situation, the 
conditions would still be standardized. But 
our review of the literature gives no indi- 
cation that either control has been effected, 
though partial attempts were made, as we 
have seen. The object of the present study 


is to examine the variable of rapport in order 


to see if it does have any effect on intelli- 
gence-test performance. 


PROCEDURE 


The subjects in this experiment were three groups 
of ten three-year-old nursery-school children of 
similar social and cultural background. Two were 
used as experimental groups; the third was a con- 
trol. 

The test used was the 1937 Revision of the Stan- 
ford-Binet. 

All of the tests were performed by the writer, 
who was reasonably experienced in dealing with 
young children. If any testing error was present, 
it was a systematic one and would therefore not 
have any adverse effect on reliability. All of the 
testing was done in rooms familiar to the children. 


MEAN 


AND STANDARD Deviations For Each Group 


Post-EXPERIMENTAL 


RETEST CHANGE 


S.D. 


9.23 
5.67 
g.d1 


S.D. 


perimental period ended, all three groups received 
Form M of the Stanford-binet, with the attempt 
to hold test-administration rapport, in its usual sense, 
constant. The attempt, s» far as could be de- 
termined by the admittedly subjective criterion of 
the experimenter’s judgment, was successful. 


REsuts 

There are two sets of data: the quantita- 
tive results of the IQ testings and the behavior 
protocols written for each test and for each 
day of the experimental period. 

The quantitative results are summarized in 
Table 1. 

This summary shows, first of all, that each 
group, as a group, improved its perform- 
ance on the retest. In both A and B groups, 


TABLE 2 


LeveL oF SIGNIFICANCE OF INCREASE FoR Eacn Group 


Group t Pp 
A -71 <.01 


B -64 <.05 
Cc -49 <7 











35° 
the increase is statistically significant; in the 
control group, it is not. 

The standard deviations are not, on the 
whole, larger than those found in other stud- 
ies of young children (e.g., 2, p. 165). 

Secondly, Table 1 reveals that there are 
large differences between the mean increases 
of the different groups; two of them are sta- 
tistically significant, as shown in Table 3. 


TABLE 3 
SIGNIFICANCE OF DIFFERENCES 
MEANS 


Level BETWEEN 


S OF 


PAIRED 


Some of the children missed one or more 
days of school during the experimental 
period. In order to see if days of attendance 
was related to direction and magnitude of IQ 
changes, these were plotted graphically. In- 
spection indicated that no such relationship 
existed. 

We did, however, find a slight positive 
correlation (.38) between test-retest interval 
and absolute change in IQ. Although this is 
spuriously high relative to the correlation be- 
tween interval and IQ gain, the correlation is 
still positive and may indicate some slight 
connection between time interval and gain 
in IQ points. 

The other set of data is the behavior proto- 
cols. As was expected, most of the children 
in Group A proved to be more “relaxed” in 
the retest situation than they had been in the 
initial test. This showed up in their greater 
willingness to come to the testing room the 
second time, as well as in the! test behavior 
of some of them. The children in Group B 
were also more willing to come to the test- 
ing room for the retest than they had been 
the first time; this had not been expected. 
But these children did not, as a group, seem 
to enjoy themselves as much the second time. 
The expectation had been that the control 
group, too, would be more willing to come 
for retests than they had been initially. How- 
ever, this was not altogether the case; the one 
child who had to be coaxed to come for a 
retest was in this group, and a few children 
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seemed no more at ease than they had been 
the first time. 

The behavioral results of the experimental 
period were clear-cut; the relationship estab- 
lished in Group A was a very good one, as 
indicated by both quantity and quality of 
the actions initiated by the children and by 
their responses to the experimenter. The rela- 
tionship in Group B became one of seeming 
indifference on the part of the experimenter, 
responded to by actual indifference on the 
part of the children. 


Discussion AND CONCLUSIONS 


Our analysis of the quantitative data leads 
to the immediate conclusion that our results 
could not have been obtained through the 
operation of chance factors alone. This does 
not, however, entitle us to say that the results 
are exclusively due to manipulation of the 
experimental variable—i.e., the nature of the 
social relationship between child and tester. 
What about all the other factors examined 
by other researchers as influential in the de- 
termination of changes in IQ? 

First, it seems reasonable to rule out the 
organic factor as not relevant here. Second, 
randem variability is present; the standard 
deviations are large in all three groups. But, 
as has been pointed out, they are not, on the 
whole, larger than those found in other stud- 
ies of young children. Furthermore, studies 
which may be cited agree that the IQ is, for 
practical purposes, “sufficiently” constant for 
a group as a whole, although individual 
records may show large variations. Third, 
age level, mental level, and practice effect are 
the same for all three groups, and hence 
should not disturb comparability. Fourth, 
our results show a slight positive correlation 
between time interval between tests and gain 
in IQ; a correlation, however, gives no indi- 
cation as to which is the cause and which 
the effect. While we are not prepared to 
come to any final conclusion on this point, 
we would like to point out that Group A, 
which has the greatest mean increase in IQ, 
is second to Group B with regard to mean 
interval. Nor can regression effect be brought 
in to help explain our results. Inspection of 
the original data indicates clearly that there 
is no relationship between initial IQ and 
direction or magnitude of change. Environ- 
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mental effect can be ruled out, since there were 
no systematic changes in environment dur- 
ing the experimental period. This ensures 
reliability, but it does not guarantee validity. 
To judge the latter any extension of this re- 
search should include its duplication at dif- 
ferent socioeconomic levels. The “rapport” 
factors of emotional tension, motivation, and 
attitude (of subject and of examiner) may 
very well be important determinants of in- 
telligence-test results, but it is our contention 
that these factors are themselves aspects of 
the general social relationship between ex- 
aminer and child. 

There was one important variable which 
remained uncontrolled in this investigation: 
the bias of the examiner. To control this 
factor, it would have been necessary to use 
other examiners for the initial tests, and to 
devise some way of balancing the experimen- 
ter’s knowledge, in the retest, of the groups 
to which particular children belonged. This 
was not feasible under the circumstances; we 
can only state that we were aware of this 
possibility and hope that its effect, if any, was 
minimal. 

We are left, then, with two variables to 
consider. The first is that of familiarity with 
the tester. Both experimental groups in- 
creased significantly in IQ; the increase in 
the control group was far from significant. 

Other possible variables having been elim- 
inated, we shall maintain that heightened 
famiiiarity of the child with the examiner 
can produce a significant increase in IQ, 
while the gain accruing from mere retest 
will not be significant, at least under the 
conditions of this experiment. 

This conclusion is not in agreement with 
that reached by Marine (3). Note, however, 
that she did not conclude that familiarity 
did not make a difference; her position was 
that the difference was not statistically re- 
liable. We, on the other hand, have found 
the difference to be significant where Group 
A is concerned (see Table 3), while the con- 
fidence level of the difference between 
Groups B and C is at least suggestive of a 
true difference. 

It seems reasonable that familiarity with 
the examiner should produce an increment 
in IQ. Whether or not the relationship of 
the child and the tester is a “good” one, the 
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situation is a little more structured for the 
child who has some idea of what the exam- 
iner and the situation are like. Our allega- 
tion is that this added knowledge reduces the 
emotional tension felt by the child and there- 
by enables him to perform better on an IQ 
test. This conjecture is supported by the 
results reported by Yager (4) in his study of 
the influence of emotional tension on IQ test 
scores. 

The second remaining variable is the one 
whose effect we originally set out to test: 
the nature of the social relationship between 
examiner and child. Inspection of Table 
3 demonstrates that the difference between 
the mean increase of Group A and that of 
Group ” ‘s statistically significant at the .o2 
confidence level. Other possible variables 
having been eliminated, the conclusion seems 
justified that a good relationship between ex- 
aminer and child will produce a mean in- 
crease in IQ points which is significantly 
greater than the mean gain produced by a 
poor relationship. 

With the aid of this conclusion, we can ac- 
count for the effects of motivation, and of 
attitude of both subject and examiner on in- 
telligence-test performance, in the following 
way. If the relationship is a good one, it is 
more likely that the attitudes of the parti- 
cipants in the test situation will be favorable 
to one another and to the situation; this will 
allow favorable influences, such as motiva- 
tion, to operate. Now, if tester and child 
have a good relationship, the child will be 
motivated to perform as well as he can, for 
he will see the situation as one in which he 
is doing something for the liked person. The 
converse, of course, holds for the situation 
where the relationship is poor. 

The implication of our results for the 
testing of young children is obvious: unless 
the child is well acquainted with the exam- 
iner and has had enjoyable experiences with, 
and in the presence of, the examiner, his per- 
formance on a standard test of intelligence 
such as the Stanford-Binet is likely, under 
present conditions of standardization, to re- 
sult in an underestimation of his intelligence 
as measured by such a test. Therefore, to 
ensure standardization, either school psy- 
cholgists will have to devote far more time 
than they do at present to devloping ade- 
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quate social relationships with their prospec- 
tive subjects, or all persons doing IQ testing 
will have to limit their contacts with subjects 
to the actual testing situation. 

A further implication of this study is the 
need for further examination of the concept 
of rapport. If a good social relationship be- 
tween an examiner and a group of young 
children can produce so striking a mean gain 
in IQ, then rapport, as it has been conceived 
until now, is not enough. Either we must 
say that test-situation rapport plus a good re- 
lationship is necessary for a valid estimation 
of a child’s IQ, or we must extend our 
definition of rapport to include, as a requisite, 
a clearly defined social relationship between 
the examiner and the young child. Once the 
question of definition of rapport is settled, we 
will be in a better position to decide upon a 
standardized procedure for the testing of in- 
telligence. 


SUMMARY 


A survey of the literature on changes in 
intelligence indicated the importance of a 
study taking into account the hitherto un- 
controlled variable of rapport. 

Form L of the revised Stanford-Binet was 
given to three groups of ten three-year-old 
nursery-school children. Following that, the 
examiner established, over an experimental 
period of ten school days, a good relationship 
with Group A and a poor one with Group 
B. The third group, of comparable age and 
socioeconomic background, served as a con- 


trol. After the experimental period, all of the 
children were retested on Form M. 

There was a mean increase in IQ for all 
three groups, those of the experimental 
groups being statistically significant. Further- 
more, the differences between the mean gains 
of the groups were statistically significant be- 
tween Groups A and C, and A and B, while 
the difference between Groups B and C was 
at least suggestive of a true difference. 

The conclusions drawn from this were (a) 
that familiarity with the tester makes a dif- 
ference, and (4) that within this, the nature 
of the social relationship between child and 
examiner is important. 

The implication of these findings for the 
testing of intelligence is that, whether we 
define the testing situation as rapport plus 
a social relationship, or extend our definition 
of rapport to include this factor, this variable 
must be controlled in a consistent way if IQ 
scores of different subjects are to be com- 
parable. 

REFERENCES 

1. Canapy, H. G. The effect of “rapport” 
a new approach to the problem of racial psy 
chology. J. Negro Educ., 1936, 5, 209-219. 

2. Katz, Everyn. The constancy of the Stanford-Binet 
10 from 3 to § years. J. Psychol., 1941, 12, 159- 
® 

3. seat, Epirn L. The effect of familiarity with 
rhe examiner upon Stanford-Binet test performance. 
Teach. Coll. Contr. Educ., 1929, No. 381, pp. 


on the IQ: 


42. 
4: Yacer, J. L. The influence of emotional tension on 


of children. Amer. 


(Abstract) 


the intelligence test 
Psychologist, 1946, 1, 464. 


scores 


Received by this Journar, June 18, 1951. 





FACTORS UNDERLYING WECHSLER-BELLEVUE PERFORMANCE 
OF THREE NEUROPSYCHIATRIC GROUPS ** 


JACOB COHEN 


Bronx Ve 


HE past decade has witnessed the in- 

creasing utilization of the Wechsler- 

Bellevue Intelligence Scale in the psy- 
chological diagnosis of neuropsychiatric con- 
ditions. Much of the research which supports 
this practice has been of an empirical nature, 
i.e., the scores of groups of patients were com- 
pared with those of control groups on a subtest 
by subtest basis, and patterns were developed. 
Results to date have largely been negative 
and contradictory. This is partially a func- 
tion of poor research control, but the major 
problem has been the lack of a theoretical 
basis upon which to build. Objective evi- 
dence as to the nature of the psychological 
functions measured by the Wechsler-Bellevue 
is very scant. Wechsler (10) and Rapaport 
(7) have presented rationales for the test, 
but these were intuitively developed on the 
basis of clinical experience, and can only be 
considered hypotheses in serious need of ex- 
perimental testing. 

A contradiction becomes evident when the 
writings and practices of clinical psycho- 
diagnosticians are compared wth those of 
psychometric theorists, who have applied the 
powerful exploratory technique of factor 
analysis to their test materials. The former 
group tends to accept tests on their face va- 
lidity and to assume that a test measures 
whatever it does measure quite consistently. 
Thus, for example, in Rapaport’s(7) ra- 
tionale for the Wechsler-Bellevue, a func- 
tion is ascribed to a subtest without much 
regard for the possibility that the subtest 
may vary with respect to the extent and 
nature of what it is measuring as one goes 


1 Reviewed in the Veterans Administration and pub- 
lished with the approval of the Chief Medical Director. 
The statements and conclusions published by the author 
are the result of his own study and do not necessarily 
reflect the opinion or policy of the Veterans Adminis- 
tration. 

2 This article is based upon a doctoral dissertation 
submitted to the School of Education, New York Uni- 

Grateful acknowledgments are due Professors 
Ben-Avi, E. R. Wood, and J. D. Barnard for 
guidance. 


versity. 
A. H. 


their 


359 


terans Hospital and School of Education, New York University 


from one patient grouping to another. The 
theoretical orientation of the factor analysts, 
however, allows for this possibility. 

PURPOSE 

The purpose of the present investigation 
was threefold: 

1. To discover the nature 
underlying performance on 
Bellevue of psychoneurotic, 
and brain-damaged patients. 

2. To compare these factors among the 
groups. 

3. To develop a factorially based rationale 
for the subtests of the Wechsler-Bellevue in 
terms of these factors. This problem will be 
treated in a future paper (4). 


of the factors 
the Wechsler- 
schizophrenic, 


SuBJECTS 


The patients were all hospitalized white male 
veterans between the ages of twenty and forty.® 
On the basis of the final neuropsychiatric diagnosis, 
they were classified as psychoneurotic, schizophrenic, 
or brain-damaged. Cases with mixed diagnoses 
were discarded. Suitable cases were then eliminated 
from each group so as to achieve three groups of 
100 patients each which were approximately equated 
for Full Scale IQ, age, and number of years of 
education. The means and standard deviations of 
these variables for the three groups are presented 
in Table 1. 

Psychoneurotic Group. Of the 100 patients in this 
group, 34 were diagnosed as anxiety state, 25 as 
conversion hysteria, 14 as mixed psychoneurosis, 
and the remainder as miscellaneous subclassifications 
of psychoneurosis. 

Schizophrenic Group. This group contained 56 
paranoids, 12 simples, 7 hebephrenics, 4 catatonics, 
and 21 mixed or unclassified cases. This group can 
be characterized as early or acute schizophrenic, the 
“back wards” type not being represented in this 
sampling. 

Brain-Damaged Group. The 100 brain-damaged 
patients were made up of 66 focal cases, 20 cases 
of generalized damage, and 14 cases classified as 
focal-diffuse. All cases were of injuries or disease 
involving the cerebral cortex of postadolescent inci- 


8 The author wishes gratefully to acknowledge the 
cooperation of Dr. H. L. Flowers, Chief, Neuropsychiatric 
Service, and Dr. Robert S. Morrow, Chief Clinical 
Psychologist, of Bronx Veterans Administration Hospital. 
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TABLE 1 


IQ, Ace, 


EXPERIMENTAL 


Group MEAN 


aND YEARS OF EDUCATION OF THE EXPERIMENTAI 


Groups 


FE.DUCATION 
4s : 
MEAN SD 








Psych 
Schizophrenic 
Brain-Damaged 


neurotic 105.9 
104.8 


98.1 








dence. Precautions were taken that no brain- 


damaged patient be included the nature of whose 
injury was such as to tend to invalidate the test 
results, ¢.g., hemiparetics and aphasics. 


PROCEDURE AND RESULTS 


Variables. The raw subtest scores of the 
Wechsler-Bellevue Intelligence Scale were in- 
vestigated. Digits Forward and Digits Back- 
ward were treated separately, which resulted 
in twelve variables. 

Analysis. For each of the groups separately, 
Pearsonian r’s were computed among the 
twelve variables.* Each of the three resulting 
matrices of intercorrelations was factored by 
Thurstone’s complete centroid method (9, pp. 
161-170), using McNemar’s criterion (6) for 
the number of factors to be extracted. The 
application of this criterion indicated that 

4The matrices of intercorrelations are not presented 
here However, any correlation co- 
efficients can be reproduced from the centroid loadings 
(Tables 2, 3 and 4). The correlation between two 
variables is the sum of the products of their loadings 
on the three centroid factors. For example, the r 
between Arithmetic and Block Design in the Psychoneu- 
rotic group from Table 2 is (.70)(.76)+(.20)(—.34)+ 
(—.28)(—.07) which equals .48. Discrepancies be- 
tween th reproduced correlation coefhcients and 
those actually obtained will rarely exceed .02. 


to comserve space. 


TABLE 2 


CenTRoID MATRIX FOR THE PsycHoNEUROTIC GrouP 


II Ill h? 


TEst I 





I " -35 o8@ 
Cc -78 .19 .26 
DF . -07 37 
DB ‘ ‘ -32 
4 7 12 .28 
S . ° -07 
V ( 45 -17 
-02 
-37 
.07 
-23 
-23 


three factors could justifiably be extracted 
from each matrix. The centroid factors for 
the three groups are presented in Tables 2, 
3, and 4. Since centroid factors are not read- 
ily interpreted directly, the customary prac- 
tice of rotating the axes to oblique simple 
structure (8, pp. 222-223) and a positive mani- 
fold was followed. The technique used was 
Thurstone’s method of rotation by two- 
dimensional sections (9, pp. 194-216). The 
results of the oblique rotations are given in 


TABLE 3 


CENTROID MATRIX FOR THE SCHIZOPHRENIC GROUP 


Test I II h? 








TABLE 4 


CENTROID MATRIX FOR THE Bratin-DaMaceD Group 








Test 
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the first nine columns of Table 5. In this 
table, similarly designated factors from the 
three groups are placed together to facilitate 
interpretation and comparison between 
groups. Decimal points are omitted and 
significant loadings (over .30) are italicized. 

Since the rotations were made to follow the 
test vector configurations, the analysis did 
not preclude the possibility of the common 
factors being intercorrelated. Such was, in- 
deed, the case, and the correlation coefhi- 
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The Common Factors 


It is immediately apparent from Table 5 
that the common factors extracted in the 
three groups are strikingly similar. Such dif- 
ferences as are found are so minor that the 
factors must be identically named from 
group to group. Thus, despite major dif- 
ferences in the nature of the psychopathology 
of the three groups, the reference abilities dis- 
covered by factor analysis are essentially the 
same for the Wechsler-Bellevue. 


TABLE 5 


ComMon Factor AND Seconp-Orper Factor LoapINGs OF THE WECHSLER-BELLEVUE SUBTESTS IN 
PsycHONEUROTIC, SCHIZOPHRENIC AND Brain-DAMAGED Groups 











SECOND-ORDER 
Factor: G 
Psy Sch BrD 


Factor C 
DisTRACTIBILITY 
Psy Sch BrD 


Factor B 
NoNnVERBAL 
Sch 


Factor A 
VERBAL 


Sch BrD Psy BrD 





67 
47 


06 
—03 


63 
50 


—06 
13 


03 
—o8 


74 
71 


04 
Il 


61 
ee 


04 
—10 


—03 
—02 


—0g 
05 


04 
—I1 
18 
05 
oI 
42 
53 
60 
64 
15 


—07 
02 


—34 
03 
—13 
36 


33 
55 
66 
25 


44 
44 
71 
13 


39 
05 
i 
—02 


—07 


60 
62 


53 
64 


57 
59 
67 
69 
77 
57 
58 
65 
35 
62 
84 
45 
84 


00 54 


69 


51 
55 
II 
07 
o2 
—06 


51 
72 
61 
40 
53 
50 
61 
40 
57 
76 
64 
78 


57 
62 
57 
62 


61 
67 
49 
64 


68 
60 
83 


34 
—04 
—ol 
—II 


09 45 
03 
09 
60 


—32! 


14 
00 


44 


03 
05 
—04 
22 


Factor A 
Factor B 
Factor C 


33 
73 
83 
4o 





are as fol- 


cients among the rotated factors 
lows: 


rsp=.38, Tac=-70, Fa0=.38. 
Pas—-58, Tac—-66, Trac—.4I. 
Yan=-49, Tac—-59, Tac—=-50. 


Psychoneurotics: 

Schizophrenics: 

Brain-Damaged: 

The intercorrelation of the common factors 
made possible, in each group, the extraction 
of a second-order factor. This second-order 
factor accounts completely for the intercorre- 
lations of the rotated common factors. By 
means of methods given by Thurstone (9, pp. 
273-276, 421-434), the correlations between 
the second-order factor and both the com- 
mon factors and the twelve subtests were 
found for each of the experimental groups. 
These are given in the last three columns of 
Table 5, under the heading “Second-Order 
Factor: G.” 


Factor A: Verbal. This factor loads Vocab- 
ulary, Information, Comprehension, and 
Similarities with appreciably high magni- 
tude in all three groups. These subtests in- 
volve richness of vocabulary and efficient 
verbal-symbolic manipulation. Vocabulary is 
obviously a verbal test. Success on the Infor- 
mation test requires the acquisition and re- 
tention of information in verbal form. The 
Comprehension test measures verbal informa- 
tion concerning personal and social usage. 
One type of verbal-symbolic manipulation, 
“verbal concept-formation,” is measured by 
Similarities, and also vocabulary richness, for 
as Rapaport points out, “. . . most of the 
Similarities items may be responded to with 
well-established generic verbal terms. . .” 
(7, P- 197)- 

The Arithmetic and Picture Completion 
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tests also have significant (but low) loadings 
in two of the groups. This is not out of keep- 
ing with its interpretation as a verbal fac- 
tor. Arithmetic reasoning can involve much 
the same type of verbal-symbolic manipula- 
tion as do the obviously verbal tests. The 
problems are presented verbally and the 
cursive reasoning necessary for doing the 
problems uses word and number symbols. 
Picture Completion’s appearance in Factor A 
was not anticipated, but it does require infor- 
mation about the physical characteristics of 
common things. This information can be re- 
tained verbally, making apparent this test’s 
functional similarity to the Information test 
and accounting for its loading on this factor. 

Rapaport (7) has succeeded in isolating all 
the subtests loading the verbal factor in ail 
the groups by intuitive means and calls these 
the “essentially verbal subtests.” However, 
he fails to include Picture Completion and 
Arithmetic, which turn out to be verbal tests 
in two of the groups. 

It was, of course, anticipated that a verbal 
factor would emerge. This is “. .. one of the 
best-established restricted abilities ....” (3, p. 


408), and one of the earliest group factors to 


be admitted by the British school (2). The 
group factor v, as described by the latter, is 
apparently identical with the primary ability 
V as rediscovered by Thurstone using his 
multiple-factor method, and repeatedly veri- 
fied by other investigators (3, p. 408). 

In the only other published factor analysis 
of the Wechsler-Bellevue, Balinsky (1) also 
found the verbal factor to be of signal im- 
portance.® Of the six age groups studied, it 
appeared in all but one and was composed, 
with but slight exceptions, of the same sub- 
tests which load it in the present investiga- 
tion. 

Factor B: Nonverbal Organization. The 
tests which load this factor in all three groups 
are Object Assembly, Block Design, Picture 
Arrangement, and Picture Completion. 
These four tests: all involve the ability to or- 
ganize nonverbal material, visually perceived, 

5 Comparison of Balinsky’s findings with those of the 
must be made cautiously, for 
Firstly, his Wechsler-Bellevue battery 
was smaller than that studied here. Secondly, the 
conditions he set in his analysis were that his common 
factors be independent of each other (i.c., orthogonal) 
in contrast to the oblique and therefore correlated 
factors of the present investigation. 


present investigation 


two reasons. 


into meaningful wholes against a time limit. 
The nature of the organizational task differs 
from test to test, but all do involve the forma- 
tion of some kind of Gestalt, Object Assembly 
requires the construction of formboards. 
Block Design involves abstract spatial visual- 
ization in the organizational task. In Picture 
Arrangement, no spatial elements are in- 
volved, but a logical story with a point must 
be organized by placing cartoon panels into 
correct temporal sequence. Finally, Picture 
Completion requires that the subject supply 
a missing element in order to complete the 
Gestalt presented. 

Factor B was therefore interpreted as one 
involving the ability to quickly and mean- 
ingfully organize visually perceived, non- 
verbal material. 

Digit Symbol, the only other subtest which 
loads this factor, does so in only the Brain- 
Damaged group. This suggests that the 
nature of the task posed by this subtest differs 
from what it is in the other groups as a func- 
tion of the changed perceptual organization 
of cortically damaged subjects. 

It is noteworthy that Picture Completion 
appears in this factor as well as in Factor 
A, the first instance thus far encountered in 
this investigation of a “complex” test, i.e., one 
which measures more than one common fac- 
tor. The interpretation of scores from such 
subtests is very ambiguous, posing an all but 
insurmountable obstacle in the way of ob- 
jective pattern analysis (4). 

Factor B does not match previously estab- 
lished factors as readily as does the verbal 
factor. This is probably attributable to the 
fact that the Wechsler-Bellevue does not af- 
ford sufficient coverage of the intellectual 
domain as would be necessary to make com- 
parisons. Thus, Factor B was interpreted 
as requiring the ability to organize nonverbal 
perceptual material against a time limit. Al- 
though this factor operates as a functional 
unity in the Wechsler-Bellevue, it could prob- 
ably be further analyzed. A factorial study 
addressed to a wide variety of clinical per- 
formance tests has been presented by Heston 
(5). Utilizing tests of the same general type 
as those which comprise the Wechsler-Belle- 
vue Performance Scale, he found three fac- 
tors: visualizing spatial relationships, speed 
of movement, and the ability to grasp readily 
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relationships or configurations of a nonspatial 
nature. Our Factor B cuts across these finer 
distinctions, and is apparently ,primarily a 
composite of Heston’s first and third factors, 
with the second probably present to a lesser 
degree. 

Balinsky (1) also isolated factors similar to 
Factor B in all but one of his groups and 
called these “performance” factors. This in- 
terpretation seems inadequate in that it fails 
to isolate a psychological process common to 
the tests which load the factor. In any case, 
the naming of the factor is merely a matter 
of interpretive opinion—the important consid- 
eration is the fact that the factor, as defined 
by its test composition, is the same in both 
studies. 

Factor C: Freedom from Distractibility. 
Only two tests load this factor in all three 
groups, Digits Forward and Digits Back- 
ward. To facilitate interpretation, Digit 
Symbol and Arithmetic, which load this fac- 
tor in two of the groups, will also be con- 
sidered. These four tests in common require 
an alert, undistracted attention for good per- 
formance. In all of them, if one of the ele- 
ments to be mentally manipulated by the sub- 


ject does not “register” when presented by the 
examiner, or is “lost” in the course of manip- 
ulation, the subject can not make up this 


loss and is penalized in the scoring. Thus 
the naming of Factor C as Freedom from 
Distractibility. 

The above analysis of Factor C accords 
most obviously with the nature of Digits For- 
ward and Digits Backward. With regard to 
the Arithmetic test, if distractibility causes 
a loss in one of the numerical or relational 
elements of the problem, either a complete 
loss or a loss in time for repetition tends 
to result in a lowered score. In addition, dis- 
tractibility can act to interfere with orderly 
arithmetic manipulation. If constant losses 
of digit-symbol associations take place, or if 
their original learning is interfered with, per- 
formance must be slowed and score reduced. 
Rapaport (7, p. 289) points out that perform- 
ance on this test is of an “attention” charac- 
ter, which one would therefore expect to be 
heavily dependent upon (if not identical 
with) freedom from distractibility. 

The only other loading of Factor C is that 
of Block Design in the Psychoneurotic group. 
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This test does not lend itself as readily to 
interpretation as do the others. A tentative 
explanation of its appearance on Factor C 
is offered: It is an exceedingly complex test 
in which the relationships among the block 
faces are numerous, and many possibilities are 
presented with each reorientation of each 
block. In such a situation, distractibility can 
readily have deleterious effects by way of 
overlooking possibilities and failing to see 
correct part-integrations even when hit upon. 
The test’s sensitivity to the effects of dis- 
tractibility is heightened by the importance 
of speed, since time bonuses and time limits 
contribute heavily to the test’s variance. 

Factor C seems of a different kind from 
the other two. This difference is most read- 
ily summed up in its consideration as a cona- 
tive factor. Because it is of a more pervasive 
nature than the other two factors, it would 
be expected to play a rather important role 
in general intellectual functioning. This ex- 
pectation is borne out later in a discussion of 
the second-order factor. 

The specific characteristics of the patients 
which give rise to Factor C deserve some 
speculative consideration. All three experi- 
mental groups are subject to the manifesta- 
tion of overt anxiety, either characterological, 
reactive, or both. Psychoneurotics vary con- 
siderably with regard to overt anxiety, as do 
early schizophrenics. In the latter group, 
contact with reality in addition to overt 
anxiety probably is an important source of the 
variance picked up by Factor C. In the 
Brain-Damaged group, in addition to reactive 
anxiety, the variability with regard to extent 
and severity of structural damage probably 
produces Factor C; ready distractibility is a 
long recognized symptom of brain damage. 

Balinsky (1) identifies no factor of Free- 
dom from Distractibility, but in comparable 
age groups finds a factor (orthogonal), sim- 
ilar in composition to Factor C, which he in- 
terprets as being a memory factor. The 
discrepancy may well be solely on the inter- 
pretive level, since freedom from distractibility 
is a necessary prerequisite to rote memoriza- 
tion, which is involved in the most highly 
loading tests. Another consideration is the 
difference in the populations studied: neuro- 
psychiatric patients are presumably more 
variable with regard to distractibility than 





304 Jacos CoHEN 


are normals, resulting in a possible spread of 
the factor from the rote memorization tests to 
the Arithmetic and Block Design tests. 
Finally, it is necessary to reiterate the cau- 
tion that Balinsky’s rotations were orthog- 
onal, making his results not strictly com- 
parable with ours. 

Discussion. It has been pointed out that 
some of the subtests load a given factor in 
some group and fail to do so in some other. 
This is suggestive of certain basic differences 
in the nature of intellectual functioning un- 
der varying conditions of psychopathology. 
That the similarity is by far greater than the 
differences has already been noted, but the 
differences cannot be gainsaid. A discussion 
of these would be too speculative for inclu- 
sion here; they are merely noted for their 
possible value in suggesting hypotheses to 
other investigators. 

A more immediately useful application of 
these differences to clinical practice is how- 
ever apparent. Since a given subtest need not 
measure the same factor from one neuro- 
psychiatric group to another, or does so with 
varying validity, it becomes necessary to 
know a subject’s diagnosis in order to inter- 
pret his scores. This is of course quite at 
variance with the use to which the typical 
clinician tries to put the Wechsler-Bellevue. 

Another difficulty in test interpretation re- 
sides in the fact that some of the subtests are 
complex in one or more groups. To exem- 
plify this problem, how is one to account 
for a low Arithmetic score in a brain-dam- 
aged patient? Is it due to impaired verbal 
ability or to distractibility? Reference to the 
other subtests may be helpful, but the prob- 
lem poses many difficulties, particularly when 
the diagnosis is not known. 


The Second-Order Factor 

Because of the appreciably high intercorre- 
lations of the common factors, the second- 
order factors which were extracted proved, 
in general, to be moderately or highly re- 
lated to the subtests and common factors. 
These correlations are presented in the last 
three columns of Table 5, each column rep- 
resenting one of the experimental groups. It 
is apparent from the magnitude of these cor- 
relations both that this is a general factor and 
that it is strongly operative in the Wechsler- 
Bellevue as a whole. 


It might be more proper to speak of gen- 
eral factors than of a general factor. Al- 
though the groups are quite similar with 
regard to the first-order common factors, 
there are noteworthy differences in the rela- 
tionship of the subtests and common factors 
to the general factors from group to group. 
With regard to the subtests, Table 5 shows, 
for example, that the most valid measure of 
the general factor for the schizophrenics is 
Block Design, and for the brain-damaged 
patients it is Arithmetic. Vocabulary is the 
best measure of the general factor for the 
psychoneurotics (.77), but is the poorest for 
the brain-damaged patients (.40). This find- 
ing is strongly consonant with the Babcock 
hypothesis and will be treated at length in 
a following article (4). There are also differ- 
ences among the groups in the correlations 
of the common factors with the general 
factor. 

To account for these discrepancies requires 
an adequate conceptualization of the gen- 
eral factors found in the three groups. To 
think of them as factors of general intelli- 
gence is not enough—the term carries with 
it a connotation of innateness which is un- 
tenable in view of the differences among the 
groups with regard to it. Secondly, the im- 
plication that a general factor exists independ- 
ently of the tests whose intercorrelations 
define it is also rejected here. If the second- 
order factor is conceived as one of present 
general intellectual functioning on the 
Wechsler-Bellevue (and symbolized as G in 
contrast to Spearman’s g), the difficulty is 
resolved : 

1. On a priori and clinical grounds, the 
Wechsler-Bellevue is a measure of present 
functioning (7, p. 37). Granting the influence 
of hereditary factors on intelligence, any 
given performance on an intelligence test is 
of necessity representative of what the sub- 
ject can do at the time the test is given 
and must therefore depend on his physical 
and psychological state at that time. 

2. As noted, the subtests and common fac- 
tors are not related to G to the same degree 
in the different groups. In a very real sense, 
the second-order general factors are different 
in the three groups in so far as present gen- 
eral intellectual functioning is a different 
phenomenon, a different end-product with 
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different antecedents, in groups of varying 
psychopathology. 

The additional entries to the right at the 
foot of Table 5 are the correlations between 
the common factors and G. In general, Free- 
dom from Distractibility is the most closely 
related to G, with the Verbal factor a close 
second and Nonverbal Organization a poor 
third. That Freedom from Distractibility 
turns out to be a demonstrably important in- 
gredient of present general intellectual func- 
tioning is not surprising when one considers 
that the ability to attend to a task of any 
nature is prerequisite to its efficient accom- 
plishment. It may seem paradoxical that 
when three factors are extracted from an in- 
telligence test, a conative factor appears and 
is the most closely related to the general 
factor. The paradox is dispelled when the 
artificiality with regard to organismic func- 
tioning of the cognitive-conative distinction 
is once appreciated. 


SUMMARY AND CONCLUSIONS 


The Wechsler-Bellevue subtest scores of 
groups of psychoneurotic, schizophrenic, and 
brain-damaged patients were separately fac- 


tor-analyzed by the centroid method with 
oblique rotation to simple structure and a 
positive manifold. This resulted in three 
correlated common factors in each group. 
The common factor intercorrelations were 
then factored and a second-order factor was 
extracted in each group. The major conclu- 
sions which emerge are: 

1. Although there are minor differences 
in factor loadings among the groups, from a 
factorial viewpoint, the same three functional 
unities, named Verbal, Nonverbal Organiza- 
tion, and Freedom from Distractibility, un- 


derlie Wechsler-Bellevue functioning in 
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psychoneurotic, schizophrenic, and_brain- 
damaged groups. 

2. A factor of present general intellectual 
functioning (G) is present in all the groups 
but differs from group to group in its rela- 
tionships with the common factors and sub- 
tests. Freedom from Distractibility and 
Verbal ability were more closely related to 
the general factor than Nonverbal Organiza- 
tion. 

3. Wechsler-Bellevue subtests do not al- 
ways measure the same factor(s) consistently 
in different types of patients, or do so with 
varying validity. It is therefore necessary 
to know a patient’s diagnosis in order to 
know the common factor functions these 
tests are measuring in him. Unfortunately, 
it is commonly for the purpose of ascertain- 
ing diagnosis that the Wechsler-Bellevue is 
given. 
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ENERALLY speaking, contributions to the 


literature on the connection between 
the psychology of learning and of psy- 
chotherapy have been relatively few in num- 
ber. Typically, the contributions which have 
been made have been reductionist in nature; 
alleged phenomena of psychotherapy have 


been reduced to comprehensibility in terms 
of hypothetical processes of learning, the 
mechanisms through which learning occurs. 
In contrast to such approaches, the approach 
taken here will revolve around an hypothesis 
regarding the nature of personal-social inter- 
action which will be considered in the light 
of phenomena of learning. The hypothesis 
may be stated as follows: 


In two-way personal-social interaction each indi- 
vidual concerned displays anticipatory behavior 
which in turn elicits anticipatory behavior on the 
part of the other, the anticipatory behaivor of 
the other in each case confirming the prior learn- 
ings of the individual about himself. 


In order to consider the role and behavior 
of the therapist in relation to the hypothesis 
it will first be necessary to make some state- 
ments regarding certain characteristics of 
human learning, conditions of social learn- 
ing, and the nature of the psychotherapeutic 
situation itself. 


Some CHARACTERISTICS AND CONDITIONS OF 
LEARNING 


In considering relevant research on hu- 
man learning we find, first of all, that 
the first learnings of human beings are 
acquired at a much slower rate and at 
a more permanent level than is often 
thought (7). For example, in the percep- 
tion of figures it has been found in case 
after case that congenitally blind adult pa- 
tients upon becoming sighted take a month 
at the minimum to approximate the form 
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and color: perception of a normal person in 
learning to recognize the property of square- 
ness, or to discriminate between such simple 
figures as squares and triangles. The dis- 
crimination then breaks down when colors 
or positions are changed (21). Usually, 
fully normal color-form perception is at- 
tained by such patients only after months 
or even years of practice. The clinical ob- 
servation and testing of these patients sug- 
gest that even the ability to perceive simple 
figures is a product of almost endlessly re- 
peated stimulus situations. These results have 
been confirmed with chimpanzees and indi- 
cate that these animals, if reared in darkness, 
later show almost complete inability for a 
very long time to discriminate simple figures 
that normally-reared animals at the same age 
could easily discriminate (6). These animals 
were tested in light under conditions of 
hunger motivation. In spite of the hunger, 
they were unable to perceive what was re- 
quired of them. This finding appears to be 
unequivocal; the first learnings of the higher 
animals, including man, tend to be slow, and 
simple tasks which are encompassed by the 
ability of both man and the lower animals 
are learned as fast (or faster) by the lower 
animals. Thus, Hebb (6) found that rats 
could learn to make certain visual discrimina- 
tions in less than an hour of total visual ex- 
perience and that primates (including man) 
required weeks and months to learn the 
same discriminations. It is also well known 
that when rats and human beings are re- 
quired to run certain types of mazes, identical 
in design, the rats do about as well as the 
humans. It is on the basis of this and other 
evidence that Hebb has come to the conclusion 
that simple relationships are learned almost 
as well by the lower animals as by human 
beings and that first learnings are faster in 
the lower. than in the higher animals. I 
understand the second conclusion to mean 
that more stimulation is required in human 
beings than in lower animals to reach the 
same level of discrimination. 
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Yet another inference comes out of com- 
parative studies of learning capacity. As 
we consider the phylogenetic scale we find 
that instinct, which plays a prominent role 
in many lower species, becomes less and less 
prominent in the higher levels of the scale. 
When we come to man, we find an organism 
in which it seems that instincts are not present 
—although this is still a controversial issue, 
there is no evidence that they are present. 
It seems fair to say that, relative to most 
animals, man is instinct-free. Associated with 
this persistive tendency for instincts to play 
a lesser and lesser role as one goes up the 
phylogenetic scale, we find an increasing 
ability to grasp complex relationships. It 
is certain that this ability reaches its height 
in man, and it is probable that the difference 
between a human being with an I.Q. of 100 
and a chimpanzee is greater than the differ- 
ence between a chimpanzee and a sea anem- 
one insofar as intelligence is concerned. In 
other words, there is a real discontinuity 
between human beings and other animals. 
The relatively enormous ability of man to 
handle complex relationships is probably oc- 
casioned by his language capacities. Asso- 
ciated with this, however, is slower first 
learning: it takes longer to build a complex 
machine than it does to build a simple one. 

There is another characteristic of the first 
learnings of the individual that has been 
called “intractability” by Hebb. This term 
refers to the effect of infant learning upon 
mature learning. For example, he has shown 
that the learning behavior of the rat blinded 
in infancy is different from that of the rat 
blinded at maturity. Modes of solution 
learned by the rat with the aid of vision may 
be utilized even though the animal is later 
blinded; apparently the early blinded rat 
is not able to acquire such modes of attack. 
Hunt (9) has shown the definite effects of 
early hunger upon later hoarding behavior 
in the rat and much the same type of effect 
has been shown by Lorenz (12) with birds. 
The conclusion is that later learning is pyra- 
mided on early learning and is subordinate 
to it in critical aspects. It is to be expected, 
although it has not been proved, that early 
learning, by virtue of the greater number 
of repetitions required, should be even more 
intractable in man than in the lower animals. 


367 

This effect, even though it may be con- 
sidered an example of iransfer of training, 
should perhaps be considered more exten- 
sively. For example, if early learning is both 
slow and permanent and dominates later 
learning, since even relatively simple visual 
perceptions must be acquired, and often pain- 
fully, then we infer that the course of de- 
velopment may be radically affected by the 
early stimuli conditions which determine the 
early learnings. With certain perceptual 
deprivations the individual may not acquire 
what are regarded as simple perceptual skills, 
and thus may not be able to deal with situa- 
tions in which the ability to handle complex 
relationships is crucial. Obviously, if triangles 
cannot be perceived it is more difficult to 
learn geometry. Thus, we may suffer a 
functional loss in one of the primary mental 
abilities solely through an early perceptual 
deprivation. It is obvious that any such de- 
privation may have serious consequences on 
the course of development of personality 
characteristics. As has been shown so well 
with brain injured patients, individuals can 
and do react to their own characteristics, in- 
cluding their own deficiencies. Many of 
these reactions may of course be learned from 
other persons. 

We can consider the effect of early ex- 
periences upon learning in a psychothera- 
peutic situation as a case of transfer of train- 
ingy. Transfer is defined as the influence 
of prior experience (or antecedent learning) 
upon present experience (actual present learn- 
ing). In this case, it should be recognized that 
from the standpoint of transfer of training, 
later social learnings are not essentially differ- 
ent from the earlier; that, in fact, they are con- 
sistent with and derived from them. It is 
in this sense that we are always dealing with 
the earlier learnings and responses inherent in 
the present behavior of the client. 

Let us consider some of the phenomena 
subsumed under transfer of training. One 
phenomenon is generalization. Each new 
stimulating condition tends to elicit responses 
similar to those produced in similar stimulat- 
ing conditions in the past. Similarity here 
covers identical stimulating conditions, re- 
semblance without identity, and transposition 
in which stimuli having the same inner rela- 
tions but not the same magnitude are pre- 
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sented. It is of interest that in transposition 


situations as the stimulus variation increases, 
verbal methods of attack become more and 
more superior to motor methods; a learned 


response to a given stimulus is not necessarily 
directly coordinate to that stimulus. Also, 
if an organism has a response in its behavior 
repertory, then learning to make that response 
to a new stimulus favors positive transfer. 
On the other hand, learning to make a new 
response to an old stimulus results in negative 
transfer, the degree of interference being a 
function of the amount of similarity becween 
stimuli. 

With respect to the transfer of responses, 
or response generalization, the facts may be 
stated as follows: responses other than the 
original learned response may be elicited by 
a given stimulus situation. Thus transfer 
of response has been demonstrated from one 
hand to the other in tapping, pursuit-rotor 
learning, manual maze running, mirror draw- 
ing, etc., and from one activity of a limb to 
another activity of that limb. That is, a 
skill learned one way may be performed 
another way: I can write with my pen in 
my fist and do a fair job as compared to my 
usual mode of handwriting. 

One of the most characteristic phenomena 
of human learning is the unspecific learnings 
occurring during the learning process which 
have been called variously, sets, methods of 
solution, and principles. As McGeoch (13) 
points out, each of these terms implies the 
other. In any event, these terms refer to 
the fact that learnings occur that are not a 
part of the learning task. A _ significant 
feature of such learnings is that the subject 
may or may not be able to report on them, 
although he may exhibit them in a later situa- 
tion. It is also true that such factors are more 
noticeable in adults than in young children. 
This is probably a consequence of the ap- 
parent fact that learning in the adult is 
primarily perceptual. The necessary motor 
skills are already in the repertory of the adult, 
whereas the child must learn them. It might 
be added that the learning of general princi- 
ples, attitudes, sets, etc., point to the existence 
of central neural factors which tend to de- 
termine the direction of activity and to sus- 
tain it in one particular direction. 

These central factors are learned in addi- 
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tion to the task set before the learner, and in 
terms of social learnings what is learned in 
social situations is much more than the 
stimuli “attended to.” Another consequence 
of the learning of general principles, sets, 
and attitudes is that they determine selectivity 
of perception as well as of response. This 
becomes obvious as soon as one considers 
that a social act (or any other behavior) is 
not necessarily a consequence of environ- 
mental stimulation but is often self-initiating. 
Transfer of training thus refers not only to 
what the individual does, but also to what 
he perceives; to the carrying over of single 
responses, or to the carrying over of sets 
and principles. It is this latter aspect of 
transfer that makes it so hugely important 
in human affairs. As McGeoch (13) states: 
“It (transfer) is a basic factor in originality, 
the original and creative person having, 
among other things, unusual senstitivity to 
the applicability of the already known to 
new problem situations. Perceiving, at what- 
ever level, is probably never free of its in- 
fluence; and there is no psychological com- 
plex event which is not a function of it.” 


Aspects OF PERSONALITY DEVELOPMENT 


What are the consequences of these charac- 
teristics of learning and how do they affect 
the development of the personality? Con- 
sideration of the social conditions culturally 
imposed upon children is relevant here. Al- 
though child-rearing practices vary greatly 
from culture to culture, from care by one 
or two caretakers (parents) to care by several, 
it seems that young children are reared in a 
relatively constant psychological environment. 
That is, the social behavior of caretakers with 
respect to their charges is fairly constant or 
uniform (or restricted in terms of the range 
of available responses). Thus recurring pat- 
terns of stimulation are presented to the 
child. Of course these behaviors may have 
complex origins and they may also be be- 
wildering to the child, but they must have 
constant characteristics if the knowledge we 
have about adult personality is valid. Thus, 


2 Everyday experience indicates that many differentially 
reacting caretakers would be a disastrous circumstance. 
With many caretakers, the child may learn that the 

world is unpredictable, that his own behavior 
bring unforeseen results. We get the kind of 
usually attributed to “inconsistent” parents. 
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the stage is set for the child to learn what 
he is like from the repeated and recurring 
behavior patterns presented to him by his 
caretakers. This is one of the main sources 
of learning for the child. 

Take, for instance, the situation of the 
neonate with respect to social learning. He 
is helpless. About the only behavior in his 
repertory consists of reflexes such as sucking 
and the characteristic gross bodily responses so 
widely observed. He cries, thrashes about, 
and generally makes a commotion which 
brings responses from his caretakers. It is 
noticeable that, in general, the behavior of 
parents tends overwhelmingly to be in re- 
sponse to infants. ‘The proportion of “spon- 
taneous” approach behavior to infants and 
children tends to be small. This statement 
is based on the observation of mothers with 
new-born children. The fact that most adult 
caretakers have functions other than child- 
rearing makes the statement seem probable 
to the writer. Such a situation is calculated 
to induce the child to learn several things 
which may be summarized as follows: (a) 
“The responses of others to me occur as a 
result of my responses,” (4) “I control the 
behavior of others,” (c) “I am what the 
response of others to me are.” 

It should be emphasized that these and 
similar learnings occur as first learnings with 
respect to social situations, but they are 
learned slowly over hundreds and even thou- 
sands of repetitions, and as first learnings they 
are intractable. Reasoning from the circum- 
stances confronted by the child in response 
to his own behavior, it appears that every 
child must learn about himself as a social 
object just as he learns gradually that he is 
the origin of the coordinates up-down, front- 
behind, right-left, and past-future. He learns 
about himself as a physical object from such 
experiences as scratching his own face and 
from experiences with generally non-reacting 
objects such as his crib, his toys, his bottle, 
etc. He learns about himself as a social ob- 
ject primarily from the responses of his 
caretakers to himself. There is evidence that 
what may be called the “self as physical ob- 
ject” and the “self as social object” are 
originally separate and merge only slowly. 
Generally, two-year-old children are quite 
adept at locating themselves in space if not 
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in time. However, if social selfhood is 
measured by the use of personal pronouns, 
we find that children still have difficulty up 
to the age of about four years. 

While learning what they are like from 
the responses of their caretakers to their own 
behavior, infants and children are also learn- 
ing to respond to other persons rather than 
merely to behave on the basis of hunger, 
thirst, bladder pressure, skin temperature, 
etc. These social responses must, of course, 
depend also upon differentiation and control 
of the facial, pharyngeal, and other skeletal 
muscles, but as control is increasingly es- 
tablished, the responsive behavior of the child 
increases in proportion to the behaviors to 
which his caretakers in turn respond. In 
other words, social situations constitute a 
large part of the child’s environment. This 
can serve only to confirm the original learn- 
ing that the responses of others to him are 
attributable to his own characteristics. This 
situation is, in the writer’s opinion, the basic 
situation of social learning. Let us sum- 
marize the situation in this way: (a) From 
the responses of others to his own behaviors, 
the child necessarily learns what he is like 
as a social object. (4) As he begins to gain 
muscular control, the child himself responds 
to others on the basis of anticipations of their 
reactions; his perception of himself as a 
social object; and his own responses which 
evoke responses from his caretakers, these 
responses being a reflection of the behavior 
of the caretaker and the behavior of the child. 

This formulation implies that the course of 
personality development is highly dependent 
upon the nature of early social learnings; 
that the child early becomes autonomous; 
that his course of development, while de- 
termined and dominated by early experiences, 
begins relatively early to revelve around 
those experiences rather than around passive 
reception of social stimuli. The child ac- 
tively creates a substantial part of his social 
environment and begins to determine what 
social stimuli are received. The domination, 
so to speak, of the social environment through 
the display of anticipatory behavior which 
derives from earlier learnings leads to con- 
firmation of the earlier learnings about self. 
In other words, as they grow older human 
beings become more and more like them- 
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selves, more consistent. The social learnings 
accumulate to determine subsequent learn- 
ings about self. The individual develops a 
life-style, an approach to personal-social ex- 
perience firmly based on and congruent with 
earlier experiences. 


Tue BEHAVIOR OF THE THERAPIST 

Considering the described characteristics of 
human learning, the stimulus conditions to 
which most, if not all, children are subject, 
and the creation of actual social realities by 
way of the basic conditions of social inter- 
action, the preceding discussion implies that 
tl 


ganism 


1¢ psychotherapist is confronted by an or- 
whose quite 
fixed. It follows that if the client is to change 


basic organization is 
the on-going course of his development in 
any important respect, the therapist must so 
regulate his own behavior that those factors 
in the client which lead to the creation of 
personal-social, interactional situations con- 
firming 
guished. 


existing self-learnings extin- 
And must such a 
fashion that the client is in a position to per- 
ceive himself differently as a social object 
and to realize the role of his own anticipatory 
behavior in evoking responses from others. 
The effect of this self-regulation of behavior 
by the therapist is the establishment of a 
communication situation in which the existing 
perceptual, conceptual, and motivational or- 
ganizations may be reorganized in a fashion 
which permits the client to develop habits of 
adjustment which do not give rise to new 
tensions, self-rejection, guilt, etc.’ 

The importance of the problem of com- 
munication stems from the fact that, among 


are 


he do so in 


other things, the behavior of the therapist is 


designed to facilitate production of signs by 
the client, signs which the client will pro- 
ceed to interpret as being expressive of him- 
self.‘ To the that the verbal 
nonverbal behavior of the counselor does not 


have the same sign value for the client as it 


extent and 


>Symptoms will 


regardless of the 
that all of 
may be 


However, 
the 
had 
self-rejecting. 
the same 


not be discussed here. 
writer's 
attitudes 

They 


way they 


defenses involved, it is 
his 
summarized as 
themselves in 
others. 


experience clients have 


which 
tend to condemn 
would condemn 
4A 
sign 


of himself. 


self-cxpressive if the 
interprets the 


sign is producer of that 


fact of its production as expressive 
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does for the counselor, communication breaks 
down or does not eventuate. It seems that 
under such conditions production by the client 
of self-expressive signs would not be facili- 
tated. It is not held that the production of 
self-expressive signs is necessarily of thera- 
peutic value; it is held that such productions 
are necessary conditions to reorganization on 
the part of the cilent, because the interpreta- 
tion of signs by the organism producing them 
is in the nature of self communication, and 
one of the end products of such a process 
is a judgment or valuation of self-produced 
stimuli. By producing such stimuli, the client 
is in a position to signify and so to discover 
the factors in himself which led to the pro- 
duction of the stimuli. 

If the verbal and nonverbal aspects of the 
counselor’s behavior are incongruent, that 
is, if they express different attitudes, it is 
apparent that this inconsistent behavior of 
the counselor may have consequences for 
the client.® Much social is con- 
ventionally designed to inhibit the expression 
undesirable 
Therefore, it is likely that the nonverbal be- 
havior will be taken as expressive of the 
“real” attitude of the therapist toward the 
client. Inasmuch as it is often impossible 
to know what the nonverbal behavior de- 
notes, we have a “projective” situation in 
which a client may take the counselor’s be- 
havior as indicative of something about him- 
self or as a rejecting or evaluating attitude. 
To rephrase this in terms of transfer is to 
say that stimulus generalization occurs, the 
very ambiguity of the stimuli situation con- 
tributing to the spread of generalization. Old 
responses and attitudes are called directly 
into play, and the counselor is perceived and 
responded to in terms of the earlier learnings 
of the individual about himself and about 
others. Such behavior on the part of the 
therapist breaks down communication because 
it results in responses to the nonverbal rather 


than the verbal aspects of the counselor’s 


behavior 


of various socially behaviors. 


behavior, the significations of the behaviors 
being simultaneous and contradictory and the 
nonverbal behavior having prepotent sign 

5If he 
sistency as such. 


perceive the inc 
j 
evaluation. 


is lucky, the client will perceive the incon- 
More often, however, the client will 


ynsistency as a and concealed 


unitary 
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values for obvious reasonis.© Whatever the 
therapist wants the client to learn may be 
inhibited by the response which his own be- 
havior has called out in the client by way 
of stimulus generalization. The client has 
something new to learn, but the desired learn- 
ing is interfered with because stimuli similar 
to old stimuli are present and evoke responses 
incompatible with the responses to be learned. 
That the responses are so strong and preva- 
lent can be understood in terms of the char- 
acteristics and conditions of early social learn- 
ings as discussed earlier and in terms of 
subsequent social learnings consistent with 
them. However, the problem discussed above 
is only one instance of the manner in which 
the therapist as a stimulus or set of stimuli 
may induce negative’ transfer with respect 
to social learnings. 

The generalization of interpersonal stimuli 
would seem to be, from the standpoint of 
human affairs, the most important case of 
transfer of training. When the client comes 
to the psychotherapeutic situation he has a 
variety of responses in his behavior repertory. 
He has a social history, which is the equiva- 
lent of stating that the generalization of both 
responses and stimuli has become very great. 
Another way of stating this is that the client 
has symptoms and characteristic ways of be- 
having which, objectively at least, seem far 
removed from the original conditions of 
learning. Any stimulus, behavioral or other- 
wise, coming from the therapist may evoke 
responses which are related to the client’s 
maladjustment. Indeed, this is almost in- 
evitable considering the sets of individuals 
seeking psychotherapy. 

As an example, the counselor may resemble 
one of the parents of the client. If this be so, 
then certain responses will tend, by way of 
stimulus generalization, to be called out in- 
stead of others. In terms of the intention and 
purposes of the therapist, these responses of 
the client may or may not be realistic; the 

6One highly important aspect of many psycho- 
therapeutic situations seems to be that the client is 
consistently given the opportunity to finish a sentence 
without at the same time preparing rebuttals. This 
novel situation allows the client to stick to the psy 
chological point. 

7In speaking of positive and negative transfer, we 
refer to the conditions and results of experiments such 


as Yum’s (27), not of transference in the psychoana- 


lytic sense. 
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client is not responding to the behavior of 
the therapist as the therapist himself would 
respond (or imagines he would respond). 
Take the case of an individual who has been 
treated in a consistently rejecting fashion by 
parents. The client’s earliest social learnings 
may well be that he is an undesirable object 
and this may later be conceptualized in 
terms of “negative” attributes such as stu- 
pidity, ugliness, laziness, aggressiveness, in- 
feriority, and so forth. The social behavior 
will be anticipatory, and thus self-concepts 
will be confirmed or reinforced rather than 
denied or extinguished, since these anticipa- 
tory behaviors will be responded to, by and 
large, in such a manner as to confirm them.* 
When the therapist is met, these anticipations 
may be evoked by the stimuli presented by 
the therapist, whether they be stimuli relat- 
ing to appearance or stimuli stemming from 
the behavior of the therapist himself. If 
the stimuli of appearance generate responses, 
then the behavior may reinforce or extinguish 
them. This point is of crucial importance. 
If the behavior of the therapist is like that 
of authority figures important in the client’s 
development, then it is to be expected that 
the client with a “rejection complex” will 
learn with difficulty what he needs to learn 
from the therapeutic situation. 

Such a situation conforms to the paradigm 
of associative inhibition, and one would thus 
expect interference with the new learning. 
From the standpoint of the therapist, the 
client’s behavior may be described as follows: 


When I undertake to cure a client of his symp- 
toms, he imposes against me a vigorous and tena- 
cious resistance all through the course of therapy. 
This is really extraordinary to think that the 
client, whose symptoms cause him and even those 
about him such suffering, who is willing to make 
such sacrifices in time, money, effort, and seif- 
conquest to be freed from them, should in the 
interests of his maladjustment resist the help 
offered him. How improbable this is. 


The reader may recognize this as a para- 
phrase of Freud’s passage on resistance (2). 
Obviously, the client comes to be regarded 
as being even more unreasonable if his in- 


8 This is one reason why the nondirective therapist 
attempts to stay in the “internal frame of reference of 
the client.” Striving to understand the client in his 
own terms and to demonstrate that understanding 
tends to extinguish many of the anticipatory responses 
of the client. 
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ability to learn persists in the face of a great 


“ 


deal of “accepting,” “permissive,” and “re- 
assuring” behavior on the part of therapist; 
there is thus a danger that he will be misled 
into a plausible, simplistic error of analysis 
of the dynamics of this dramatic situction. 
We might conclude that the client wants to 
retain his neurosis. 
Let us again paraphrase Freud: 


In what way can we now account for this observed 
fact, that the client struggles so energetically against 
the relief of his symptoms in the restoration of his 
mental processes to normal functioning? Well, 
we know that the client has, with a great deal of 
effort, prevented a mental process from becoming 
conscious. The same effort is again at work during 
psychotherapy, the effort at this point being di- 
rected against the therapist in order that the uncon- 
scious will not be brought into consciousness. This 
is resistance and the process which is demonstrated 


by resistance is repression. 


From the standpoint of the empirical facts 
and the probable conditions of social learn- 
ings as discussed above, however, we infer 
that the negative transfer is involved as a 
result of the behavior (and perhaps the ap- 
pearance) of the therapist. The client has 
been constructing social realities for himself 
which confirm his early and succeeding social 
learnings and indeed determine what social 
learnings will occur. In the interpretation 
here offered, resistance is a function of nega- 
tive transfer, which arises from the presenta- 
tion of identical or similar stimuli conditions 
that tend to evoke old responses that interfere 
with new learning. 

In this interpretation it follows that, while 
it may be conceived as an attribute of the 
client, resistance also depends upon the 
stimuli presented by the therapist to the 
client. It is commonly stated that the client 
or patient is basically responding in an in- 
fantile or childish fashion. This behavior 
is designated as transference, or is claimed to 
be based on transference, and is distinguished 
from resistance. It is, of course, obvious that 
the Freudian account of transference phe- 
nomena can be stated in terms of negative 
transfer, in which case the behavior of the 
therapist becomes of crucial importance— 
indeed, of rather more theoretical importance 
than is accorded in psychoanalytic notions of 
the psychotherapeutic process. Thus resist- 
ance and transference become aspects of the 
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general problem of the facilitation of the 
learning of new response organizations. One 
inference to be drawn is that the strength and 
direction, if not the amount, of transference 
behavior will be determined in part at least 
by the therapist. Transference and resistance 
behavior are, in terms of transfer of training, 
joint functions of the social learning of the 
client and the stimuli presented by the thera- 
pist to the client, and have a single basis. 

It is perhaps obvious by now that the 
word “role,” as used in this paper, refers 
to the roles the client thrusts or attempts to 
thrust upon the therapist—not the role that 
the therapist himself adopts. Insofar as the 
client is successful in evoking responses from 
the therapist, or insofar as the therapist be- 
haves spontaneously in accordance with these 
roles or expectations, then just so far will 
transference and resistance behavior increase. 
That is, the responses of the client will appear 
to the therapist to be unreasonable and will 
appear not be coordinate with the therapist's 
planned behavior. It is an almost universal 
experience, if the reports in literature are to 
be believed, that, if the therapist adopts a 
straightforward, logical approach to the 
client, the client will in turn argue, evade, 
voice suspicion of the motives and com- 
petence of the therapist, argue without 
accepting the full meaning of what has 
been said to him; in short he will ex- 
hibit behavior which is not in accordance 
with the realities of the situation which he 
is in. To say that his behavior is not co- 
ordinate with the behavior of the therapist is 
to say that old responses are being made to 
old stimuli or to stimuli similar to the old 
stimuli. This may also be stated as a problem 
of communication. What the therapist is 
expressing verbally or nonverbally does not 
have the same meaning and implication for 
the client that it has for the therapist. 

In this connection, it is perhaps worth 
while to examine a current statement of 
psychotherapeutic technique, that of Lo- 
rand (11). He implies that more often than 
not interpretations are given quite some time 
after the first contact. If he has been read 
correctly, the “transference” (unconscious in- 
fantile attitudes) during the relatively passive 
(on the part of the therapist) period is 
largely positive, and negative transference is 
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exhibited most markedly after the therapist 
has begun to make systematic interpretations 
of the client’s behavior and personality or- 
ganization. This is what might be expected 
on the basis of transfer of training. Surely 
interpretations of one’s own behavior are 
common enough experiences, and one has 
only to observe parents with children to 
realize that a certain resemblance to parental 
behavior becomes evident when the therapist 
begins to make his interpretations. The 
situation then becomes such that negative 
transfer is more likely to occur.” If one takes 
the position that systematic interpretation is 
a key technique of therapy, he can account 
for the difficulty with which personality re- 
organization (the association of new responses 
with old or similar stimuli) is achieved, and 
also for the length of the period required. 
On the one hand, the behavior of the thera- 
pist results in negative transfer (negative and 
positive countertransference behavior only 
accentuates this). On the other hand, certain 
other behaviors of the therapist, such as an 
uninvolved attitude, permissiveness, sympa- 
thetic understanding, etc., contribute to keep- 
ing the client in the situation so that ulti- 


mately the desived new learnings occur.’® 
This is not to say that the longer process is 
inefficient. It may be that it is necessary in 
some cases to “relive” the conflicts of every- 
day life in a setting such that the client 
necessarily comes to “own” his reactions, to 
perceive them in terms of himself rather 


than in terms of others. This, however, is 
a question for future research to decide. The 
question here is the contribution of the 
therapist himself to the phenomena of the 
therapeutic process as well as the contribu- 


® Interpretations are resisted. The focus of attention 
of the client shifts to defense; this preoccupation makes 
straightforward, reasonable communication impossible. 

101t is possible that in the controversies over deep 
versus superficial psychotherapy some of the actual re- 
ferents of the controversy lie in the phenomena of 
transfer. For example, the theoretical positions which 
eventuate in differential timing and type of interpreta- 
tion (Freudian versus Adlerian, say) may well result in 
evocation of transference and resistance reactions which 
follow distinctive lines of development. In such a 
case, new learnings may occur which result in desired 
reductions of tensions and the acquisition of anticipatory 
behaviors which tend to raise rather than lower self 
esteem. In the present state of knowledge, it is by 
no means certain that the same learnings must occur 
to produce satisfactory adjustment. It is the end 
product of the learnings that “pays off.” 
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tion of the “nature” (or of the personality 
organization) of the client. 

The notion that it is the characteristic of 
the human being that he in large part con- 
structs his own social reality by way of an- 
ticipatory behavior leads us to understand 
the role of transference and resistance in psy- 
chotherapy. They serve to interfere with 
communication from the counselor to the 
client. It is important to note here that 
present-day societies do not automatically 
provide corrections for trends of experience 
resulting from anticipatory behavior and re- 
actions thereto. This is an elemental condi- 
tion of social learning in a world of surplus. 
In rats and other animals without the rela- 
tively enormous generalizing powers of hu- 
man beings, the anticipations are intimately 
related to primary drives. If the anticipatory 
responses are consistently incorrect, the or- 
ganism dies. In man, incorrect trends of ex- 
perience generally have no such effect. In 
societies which can afford the luxury of 
therapists, the anticipatory forcing of re- 
sponses out of other persons in situations 
involving interaction does not result in loss 
of life, hunger, thirst, and similar decisive 
deprivations. In general, when penalties are 
invoked in human society, they are, by the 
very nature of social interactions, so am- 
biguous that a wide variety of responses may 
fit the situation so far as the individual is 
concerned. Thus, almost any “projection” 
may, in terms of behavior, be confirmed, 
since the responses of others will be to the 
actual “projective” behavior. The personal 
construction of a social reality of which the 
individual is himself a part depends upon 
high capacity to generalize; a capacity pos- 
sessed in sufficient degree only by man, since 
man is the only organism which is simul- 
taneously intellectual, instinct-free, and social. 
These considerations seem to lead inevitably 
to the conclusion stated at the beginning of 
this paragraph. They seem especially apropos 
when one considers that in psychotherapy we 
do not spend our time designating objects 
or, even in the most “directive” therapies, 
indulging extensively in ordering clients 
about. We deal with values, and communica- 
tion with respect to values is difficult in any 
context. 

The breakdown of communication with 
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respect to “transference” stems from the an- 
ticipations of the client based upon experience 
with authority figures and traces to the 
earliest social learnings. These anticipations 
are what we usuaily speak of as the role or 
roles projected on the therapist as an au- 
thority, an expert, one who knows more, who 
can punish or frustrate by breaking off the 
relationship and who can do so without hurt 
to himself. The interference of communica- 
tion with respect to “resistance” comes from 
maintaining defensive reactions related to 
the transference anticipations and also by 
any stimulus patterns presented by the thera- 
pist which confirm the anticipations. The 
resistance is not necessarily a desire to hang 
on to the neurosis. On the contrary, this is 
a mode of adjustment which the individual 
has learned; he knows no other. It is an 
attempt to maintain oneself against antici- 
pated attack, against attacks which have been 
made upon himself and by others. If the 
client is given the opportunity, he will more 
often than not express rather directly that 
the therapist is forebearing by reason of his 
profession rather than his person; or that 
the therapist is actually evaluating negatively 
(preparing to attack), that it is inconceivable 
that he will not be evaluated. Many clients 
will be quite devious and cunning in their 

pts to force evaluations and thus con- 

their own expectations. If the therapist 
demands either way, he will in- 


ets the 
bly get reactions from the client which 


classed as defensive or self-maintain- 
It will then be found that 


still try to manipulate the situation in 


the client 


shion to justify himself. If he is of a 
therapeutic persuasion which holds that the 
drives or goals of the client must be known to 
the therapist before new learnings can occur, 
the therapist must often infer what the client's 
symbol referents actually are. 

The discussion up to this point should 
emphasize that in the present state of knowl- 
edge one of the most pressing questions is: 
what are the stimulus conditions under which 
the client or patient can learn new attitudes 
which contribute to his psychological adjust- 
ment? In attempting to outline an answer 
to this question, it becomes noticeable that in 
discussions of therapeutic technique it has 


been emphasized from Janet and Dejerine 


Joun M. 


BuTLER 


on, that therapy should be conducted against 
a constant background of stimulation which 
is characterized by such words as tolerance, 
benevolence, warmth, permissiveness, “favor- 
able circumstances for emotional reactions.” 
These conditions of stimulation, all of which 
bear a certain resemblance and probably refer 
to much the same conditions, are seldom dis- 
cussed, strange to say, under the heading of 
technique. They are usually mentioned as 
indispensable prerequisites to psychotherapy. 
Perhaps it is because the necessity of such 
conditions is obvious. Nevertheless, it is 
desirable to point out here that the attitudes 
denoted by such terms must be expressed 
both verbally and nonverbally before the 
client can ascertain with the necessary degree 
of confidence that the therapist possesses 
them. It is here suggested that these condi- 
tions are not only prerequisites to psycho- 
therapy, to new learnings, but they are also 
primary stimulus conditions which bring 
about the required conditions for learning. 

Consider in this connection the social sit- 
uation in infancy and early childhood. Adults, 
as we know, have relatively unchanging per- 
sonality patterns. The child 
sidered as living in a relatively constant psy- 
chological environment because in the early 
years he is influenced almost solely by parents 
or caretakers comparison to the 
child, are not changing much. This con- 
stancy, however, is recurrent or cyclical rather 
than unchanging.’ The the 
child’s behavior are in turn, indifferent, re- 
jecting, approving, etc. The child 
tinually confronted with new situations and 
new developmental tasks. He is very often 
in the position of not being able to predict 

hat reactions he is going to receive to his 
own behavior and reactions. The very nature 
of his own anticipatory behavior does, how- 


ever, tend to bring consistency into his social 


can be con- 


who, in 


reactions to 


is con- 


environment because these anticipatory be- 
haviors evoke consistent responses from his 
parents. This consistency comes relatively 
late because of the slow early learnings of 
the child. That is, the recurring patterns of 
behavior presented by the parents to the 
child are perceived necessarily by the child 


of view of the child the parents 
unpredictable. To the ob- 


From the point 
seem inconsistent or 
however, the parents are 


may 
server, only too predictable. 
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as disconnected (at first at least) and are also 
transformed somewhat by the child himself; 
the child organizes both himself and his 
social environment. Now when the indi- 
vidual comes for psychotherapy, he enters 
a situation which is as absolutely constant as 
the therapist is able to make it with respect 
to the attitude of the therapist toward the 
client. Regardless of other behaviors of the 
therapist, he does not (or tries not) to react 
to the client in terms of disapproval, rejec- 
tion, and so on; in short, he does not evaluate 
the client negatively. This consistency of 
expressed attitudes, which is most often ex- 
pressed nonverbally, corresponds to an invi- 
tation to the client to produce certain stimuli, 
verbal or nonverbal, to which he himself 
will react and which he is probably more 
often than not unable to produce elsewhere. 
The behavior of the therapist at the point of 
production of these stimuli then becomes of 
paramount importance.” If the therapist is 
constantly expressing the attitudes described 
above, he is not, in the nature of the case, 
taking over the roles assigned to him by the 
client; he is not meeting the client’s antici- 
pations. Therefore, the constant expression 
of these attitudes constitutes a direct denial 


of the client’s previous experience and as 
such has its effect in making a situation 
favorable for new learnings with respect to 


both stimulus and response. 

The expression of these “encouraging” atti- 
tudes by the therapist is, it is believed, com- 
mon to almost all therapists, eclectic as well 
as systematic in their orientations. What then 
accounts for the differences in the length of 
psychotherapy, for differences in the kinds 
of verbal behavior elicited from clients, which 
seem to be characteristic of the different 
orientations? If the individuals who choose 
therapists of differing orientations are not 
basically different, it seems reasonable to 
suppose that the characteristic differences in 
length of therapy and in what is said by 
clients in, say, psychoanalytic therapy and in 
client-centered therapy are occasioned by dif- 
ferent processes set in motion by the differ- 

12 This is especially true when the planned therapy 
is based on repeated verbal corrections, commands, and 
rohibitions to meet the constructual needs of the client. 
The warmth and apparent sincerity of the therapist's 
nonverbal acceptance is, of course, decisive in holding 
the client in therapy. 
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ential behavior of the therapists. For instance, 
Lorand, a training analyst, divides temporal 
sequences in psychoanalysis (in relation to 
technique) into three phases. In the first 
phase, primarily through the observation of 
transference manifestations, the patient is 
encouraged to express his thoughts and feel- 
ings in order to give the therapist a picture 
of his life history and of the development of 
his behavior patterns. Observation, then, is 
the main job of the analyst in the first phasé. 
As the patient realizes that he is not being 
criticized, he loses some of his fear and guilt 
and “spontaneous” improvement occurs. The 
analyst leaves transference out of the dis- 
cussion as much as possible. 

In the second phase resistance develops as 
the patient becomes aware of his problems. 
The transference relationship becomes promi- 
nent and interpretation of the transference 
becomes important. On becoming the center 
of the patient’s desires, the analyst also be- 
comes a focal point of his aggression and ;re- 
sistance then moves into the foreground. The 
analyst must continually review the patient’s 
behavior and point out the connection be- 
tween the patient’s present and past attitudes. 
This activity on the part of the therapist, 
compared to his relative inactivity in the 
first phase, is necessitated by the flowering 
of the transference and by the patient's in- 
creasing aggression and resistance. 

The third phase is, apparently, known by 
its fruits. The transference gradually weakens 
as a result of the activity of the therapist in 
overcoming or breaking down resistance 
and in alleviating anxiety. The patient be- 
comes more self-reliant and learns to do 
things by himself. 

The process is summarized as follows: By 
reliving past experiences through transference 
and under the favorable emotional circum- 
stances of psychotherapy and by having the 
benefit of the interpretations of the analyst, 
the patient has his problems clarified, comes 
to understand projections, repressions, and 
unconscious fears. Of course, the therapist 
must not make the transference situation a 
reality. 

It is believed that this process described 
by Lorand can be derived from the kind 
of results obtained in transfer-of-training ex- 
periments. The patient comes in for therapy 
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with anticipations or sets resulting from his 


important social interactions. He is met by 


a therapist whose behavior is permissive and 


accepting. This behavior corresponds to a 
“correct” response from the experimenter 
and encourages the patient to continue the 
kind of responses which he has been making. 
At the same time, however, there is inhibition 
because the thoughts and feelings being ex- 
pressed have not hitherto been encouraged. 
With the continued expression of permissive- 
ness by the therapist, the inhibitions lessen. 
As they lessen and as the patient continues 
to express himself he learns more and more 
that his thoughts and feelings are not wrong 
and “spontaneous improvement” occurs; i.¢., 
he learns more about himself and can reason 
with himself. 

The continuing self-revelations, however, 
lead more and more to the expression of 
self-forbidden thoughts and feelings that 
have brought rejection from self and pre- 
sumably from others. Thus, self-expression 
leads to conflicting anticipations and to the 
development of “resistance” or lack of pro- 
gress. At the time when the conflict between 
anticipations of rejection and of acceptance ** 
arises, the therapist changes his behavior and 
begins his interpretations. This should be 
a crucial point in the development of the 
therapy, because just at the point where the 
patient is anticipating evaluation and rejec- 
tion as well as nonevaluation the therapist 
begins to produce stimuli resembling some of 
those connected with the patient’s malad- 
justment. The old experience of the patient 
is reinforced or confirmed; he learns again 
what he already knows about himself. The 
experimenter has said “wrong.” 

It must be emphasized that the behavior 
of the therapist need not be the same, say, 
as that of a parent. It need only be similar 
in the ways earlier described. Of course, 
the balance between confirming the client's 
earlier learnings through interpretation and 
denying them by acceptance and permissive- 

18 This 


It seems 


situation is probably somewhat complicated. 
two sources of rejection and one of ac- 
ceptance w be anticipated; the patient can reject 
himself and the therapist can reject him, whereas 
only the therapist can accept him. As the self- 
revelations become more disturbing the possible double 
source of rejection perhaps begins to outweigh the 
possible single source of acceptance. 


that 
uld 
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ness must be kept favorable so the patient’s 
anxiety will not impel him to leave the situa- 
tion. Thus, the continued emphasis on the 
dosage and alleviation of anxiety in the 
literature on technique. It is this difference 
in the behavior of the therapist which gives 
the therapeutic atmosphere its distinctiveness. 
It is also this difference which finally allows 
the patient to “accept” the interpretation of 
the therapist. To accept the interpretation is 
to be in communication with the therapist, 
because the patient can accept it only if it 
has the same meaning for him that it has 
for the therapist. This is not to say that 
incorrect interpretations are accepted. The 
criterion of the correctness of an interpreta- 
tion is, in the understanding of the writer, 
the patient’s reaction, whether rejecting or 
accepting. If the patient accepts it totally, 
it is beld to be correct. If not, the patient 
is either right or he is resisting and further 
investigation or further progress with re- 
sultant “spontaneous” acceptance then pro- 
vides the answer. Thus, one of the meanings 
of the growth of “self-reliance” of a patient 
in analysis seems to be his growing ability 
to make valid discriminations between correct 
and incorrect interpretations. He can re- 
ceive an interpretation without perceiving and 
responding in old ways. The consequence of 
this is that the patient can understand the 
referents of the symbols employed by the 
therapist and then can decide whether the 
symbols actually refer to factors in himself. 

The psychoanalysts and the psychoana- 
lytically oriented are not inclined to interpret 
the behavior of the therapist in terms of 
transfer of training effects. For instance, 
one writer (18) takes the view that psycho- 
therapy necessarily involves a social conflict 
in which symptoms (means of dealing with 
inner conflict) are converted into defenses 
(means of resisting the therapist). This con- 
flict has one of two results: the client termi- 
nates the therapeutic relationship or he enters 
into a relational struggle with the therapist 
in which the therapist takes parental roles 
and educates the client to maturity. It is 
noticeable that in this formulation and in 
others with a similar orientation it is implied 
that the therapist accepts the gauge of battle 
and he does so, not in his own interests or 
in terms of his own needs, but in terms of 
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the needs of his client. He is working for 
the client. A paradox is created, neverthe- 
less, because the challenge accepted is to an 
interpersonal struggle. The ambivalence of 
this situation can exist only because it is held 
that the “transference” and “resistance” be- 
havior originates in client-produced stimuli. 
It follows then that their emergence in the 
form of a relational struggle becomes in- 
evitable. Placing the locus of transference 
and resistance in the client has the conse- 
quence that the therapist himself tends to 
present the client with certain stimuli rather 
than others. It then follows, in terms of our 
interpretation of the nature of social inter- 
action, that the typical phenomena described 
by psychoanalytic therapists eventuate. The 
expectations of the therapists are thereby con- 
firmed by their own observations; antici- 
patory behavior produces a social reality for 
the therapist.* In view of the similarity 
from case to case with respect to transference 
and resistance behavior, it quite naturally 
follows that it “is through the handling or 
‘working through’ of the transference that 
permanent immunity to the neurosis is 
achieved, if at all” (18). Again, in our in- 


terpretation, this is a statement that the un- 


reasonable behavior of the client must be 
worked with since it is partly evoked by the 
behavior of the therapist. 

Generally, the sequence of behavior in 
client-centered therapy is quite different from 
the psychoanalytic sequence (20). It is the 
experience of the writer that “transference” 
and “resistance” reactions often appear early 
in client-centered therapy in much the same 
fashion as is implied by Lorand in his de- 
scription of the first phase of psychoanalytic 
therapy. From there on the development of 
such behaviors is different. The reactions 
tend to disappear although they also tend to 
reappear as new problem areas are uncovered 
and discussed. Also, the writer has some- 
times ventured to interpolate interpretations 
at about mid-course; the reactions have almost 

14 This suggests that even though the therapist's ob- 
servations of the client were accurate (we know that 
they are not), they are not enough. In order to depict 
the situation accurately, the therapist must know what 
stimulus value he has for the client at the moment 
of observation. The more variable the behavior of 
the therapist, the more difficult it is for him to under- 


stand his own role in evoking responses from the 
} 
chent. 
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universally been extreme and would, I think, 
be described as “transference” and “resistance” 
reactions. Interpretations near the end of 
therapy tend to be received calmly and reason- 
ably and accepted or rejected on what the 
writer conceives to be a reality basis. 

It is here hypothesized that initial “trans- 
ference” and “resistance” reactions die out 
because the therapist's behavior is consistently 
accepting and permissive and is directed 
toward understanding the frame of reference 
of the client as expressed verbally and non- 
verbally. Stated in terms of communication, 
this means that the signs and symbols (es- 
pecially the valuative signs) have much the 
same meanings for both client and therapist; 
therefore, the client and the therapist are 
communicating. In terms of transfer of 
training, it is believed that the effect of the 
particular kind of communication stressed by 
many client-centered therapists and the ex- 
pressed attitudes of acceptance (and kindred 
attitudes) serve to deny and extinguish antici- 
pations of the client with regard to the thera- 
pist. Thus, although “transference” is always 
incipient, it does not develop into a “trans- 
ference relationship” because the behavior of 
the therapist progressively extinguishes the 
reactions. The often violent appearance of 
“transference” and “resistance” reactions 
when the therapist changes behavior before 
the end stages of therapy indicates perhaps 
that such anticipations exist. Also the non- 
appearance of such reactions in the last phases 
may indicate that generalization has taken 
place to the extent that not all (and perhaps 
few) of the important emotional experiences 
of the client need be “relived” or “worked 
through” explicitly with the therapist. 

1 of SUMMARY 

In thent aper the behavior of the psycho- 
therapist has been considered in terms of a 
central hypothesis of social interaction: In 
two-way personal-social interaction each indi- 
vidual concerned displays anticipatory be- 
havior which in turn elicits anticipatory 
behavior on the part of the other, the antici- 
patory behavior of the other in each case 
confirming the primary learnings of the 
individual about himself. 

In considering the hypothesis in relation 
to client-therapist interaction it was hypothe- 
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begins to produce stimuli resembling some of 
those connected with the patient’s malad- 
justment. The old experience of the patient 
is reinforced or confirmed; he learns again 
what he already knows about himself. The 
experimenter has said “wrong.” 

It must be emphasized that the behavior 
of the therapist need not be the same, say, 
as that of a parent. It need only be similar 
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earlier learnings through interpretation and 
denying them by acceptance and permissive- 
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ness must be kept favorable so the patient's 
anxiety will not impel him to leave the situa- 
tion. Thus, the continued emphasis on the 
dosage and alleviation of anxiety in the 
literature on technique. It is this difference 
in the behavior of the therapist which gives 
the therapeutic atmosphere its distinctiveness. 
It is also this difference which finally allows 
the patient to “accept” the interpretation of 
the therapist. To accept the interpretation is 
to be in communication with the therapist, 
because the patient can accept it only if it 
has the same meaning for him that it has 
for the therapist. This is not to say that 
incorrect interpretations are accepted. The 
criterion of the correctness of an interpreta- 
tion is, in the understanding of the writer, 
the patient’s reaction, whether rejecting or 
accepting. If the patient accepts it totally, 
it is held to be correct. If not, the patient 
is either right or he is resisting and further 
investigation or further progress with re- 
sultant “spontaneous” acceptance then pro- 
vides the answer. Thus, one of the meanings 
of the growth of “self-reliance” of a patient 
in analysis seems to be his growing ability 
to make valid discriminations between correct 
and incorrect interpretations. He can re- 
ceive an interpretation without perceiving and 
responding in old ways. The consequence of 
this is that the patient can understand the 
referents of the symbols employed by the 
therapist and then can decide whether the 
symbols actually refer to factors in himself. 

The psychoanalysts and the psychoana- 
lytically oriented are not inclined to interpret 
the behavior of the therapist in terms of 
transfer of training effects. For instance, 
one writer (18) takes the view that psycho- 
therapy necessarily involves a social conflict 
in which symptoms (means of dealing with 
inner conflict) are converted into defenses 
(means of resisting the therapist). This con- 
flict has one of two results: the client termi- 
nates the therapeutic relationship or he enters 
into a relational struggle with the therapist 
in which the therapist takes parental roles 
and educates the client to maturity. It is 
noticeable that in this formulation and in 
others with a similar orientation it is implied 
that the therapist accepts the gauge of battle 
and he does so, not in his own interests or 
in terms of his own needs, but in terms of 
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the needs of his client. He is working for 
the client. A paradox is created, neverthe- 
less, because the challenge accepted is to an 
interpersonal struggle. The ambivalence of 
this situation can exist only because it is held 
that the “transference” and “resistance” be- 
havior originates in client-produced stimuli. 
It follows then that their emergence in the 
form of a relational struggle becomes in- 
evitable. Placing the locus of transference 
and resistance in the client has the conse- 
quence that the therapist himself tends to 
present the client with certain stimuli rather 
than others. It then follows, in terms of our 
interpretation of the nature of social inter- 
action, that the typical phenomena described 
by psychoanalytic therapists eventuate. The 
expectations of the therapists are thereby con- 
firmed by their own observations; antict- 
patory behavior produces a social reality for 
the therapist.* In view of the similarity 
from case to case with respect to transference 
and resistance behavior, it quite naturally 
follows that it “is through the handling or 
‘working through’ of the transference that 
permanent immunity to the neurosis is 


achieved, if at all” (18). Again, in our in- 


terpretation, this is a statement that the un- 
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worked with since it is partly evoked by the 
behavior of the therapist. 

Generally, the sequence of behavior in 
client-centered therapy is quite different from 
the psychoanalytic sequence (20). It is the 
experience of the writer that “transference” 
and “resistance” reactions often appear early 
in client-centered therapy in much the same 
fashion as is implied by Lorand in his de- 
scription of the first phase of psychoanalytic 
therapy. From there on the development of 
such behaviors is different. The reactions 
tend to disappear although they also tend to 
reappear as new problem areas are uncovered 
and discussed. Also, the writer has some- 
times ventured to interpolate interpretations 
at about mid-course; the reactions have almost 

14 This suggests that even though the therapist’s ob- 
servations of the client were accurate (we know that 
they are not), they are not enough. In order to depict 
the situation accurately, the therapist must know what 
stimulus value he has for the client at the moment 
of observation. The more variable the behavior of 
the therapist, the more difficult it is for him to under- 
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universally been extreme and would, I think, 
be described as “transference” and “resistance” 
reactions. Interpretations near the end of 
therapy tend to be received calmly and reason- 
ably and accepted or rejected on what the 
writer conceives to be a reality basis. 

It is here hypothesized that initial “trans- 
ference” and “resistance” reactions die out 
because the therapist’s behavior is consistently 
accepting and permissive and is directed 
toward understanding the frame of reference 
of the client as expressed verbally and non- 
verbally. Stated in terms of communication, 
this means that the signs and symbols (es- 
pecially the valuative signs) have much the 
same meanings for both client and therapist; 
therefore, the client and the therapist are 
communicating. In terms of transfer of 
training, it is believed that the effect of the 
particular kind of communication stressed by 
many client-centered therapists and the ex- 
pressed attitudes of acceptance (and kindred 
attitudes) serve to deny and extinguish antici- 
pations of the client with regard to the thera- 
pist. Thus, although “transference” is always 
incipient, it does not develop into a “trans- 
ference relationship” because the behavior of 
the therapist progressively extinguishes the 
reactions. The often violent appearance of 
“transference” and “resistance” reactions 
when the therapist changes behavior before 
the end stages of therapy indicates perhaps 
that such anticipations exist. Also the non- 
appearance of such reactions in the last phases 
may indicate that generalization has taken 
place to the extent that not all (and perhaps 
few) of the important emotional experiences 
of the client need be “relived” or “worked 
through” explicitly with the therapist. 


SUMMARY 

In t per the behavior of the psycho- 
therapist ‘has been considered in terms of a 
central hypothesis of social interaction: In 
two-way personal-social interaction each indi- 
vidual concerned displays anticipatory be- 
havior which in turn elicits anticipatory 
behavior on the part of the other, the antici- 
patory behavior of the other in each case 
confirming the primary learnings of the 
individual about himself. 

In considering the hypothesis in relation 
to client-therapist interaction it was hypothe- 
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sized that the therapist’s behavior is so in- 


ul 


nately interconnected with the behavior of 


the client that exact observation of the thera- 
pist is a necessary precondition to under- 


Sti 


nding the behavior of the client. Con- 


sideration of the behavior of the therapist in 
relation to the hypothesis, to conditions, and 


to characteristics of human learning, social 
learning, and personality development led 
to the development of hypotheses concerning 
the role of resistance, transference, and inter- 
pretation in psychotherapy, to the necessity 


of 


ce 


“reliving the past,” and to hypotheses con- 
rning descriptions of temporal sequences 


of behavior in psychoanalysis and in client- 


ce 


ntered counseling. 
The basic viewpoint taken was that the 


alleged phenomena of psychotherapy can be 
considered fruitfully in terms of phenomena 


of 


learning as well as in terms of learning 


theory. 
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ENVIRONMENTAL DETERMINANTS OF BASIC DIFFICULTY 
PROBLEMS * 


A. J. DRUCKER ano H. H. REMMERS 


Purdue University 


NUMBER of studies dealing with en- 

vironment and behavior have revealed 

significant relationships between home 
factors and children’s attitudes. Mothers’ 
attitudes have been shown to correlate 
highv with children’s attitudes (7). Mothers’ 
education level has been found to be highly 
related to attitude of the child toward how 
to bring up a child (4). Citizenship attitudes 
of high school pupils have been found to be 
related to such environmental factors as 
socioeconomic status, level and type of educa- 
tion of the fathers, and extent of rurifica- 
tion of the home community (2). Using the 
SRA Youth Inventory and other instruments, 
Auld has recently demonstrated that there 
are differences between people from different 
social classes in the way they respond to per- 
sonality tests, corroborating the above stud- 
ies. Further, he cautions about differences 
as “trivial” or “artifactual” rather than as 
meaningful ones and urges that any class dif- 
ferences be taken into account in research and 
in clinical practice (1). 

The criterion variable of this study, de- 
signed to explore environmental determinants 
or correlates of adjustment through analyzing 
attitude responses of children, is the num- 
ber of a total of 101 problem recognition 
items checked by rooo 7th and 8th graders 
on the SRA Youth Inventory. These items 
were originally earmarked by a group of clin- 
icians as being particularly indicative of po- 
tential maladjustment and are called Basic 
Difficulty items. The Youth Inventory (5), 
incidentally, was standardized on the basis 
of responses of several thousand high school 
students thereby furnishing separate norms 
for combinations of sex, grade, and rural- 
urban status but was adapted for use with 
sth and 8th graders when little or no striking 
response differences were experimentally 
noted (6). Recent validation of the Basic Dif- 
ficulty score and the other eight part scores 


1Presented at the 23rd annual meeting of the Mid- 


western Psychological Association. 
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made by 7th and 8th graders has been under- 
taken with counselor ratings of pupil adjust- 
ment as the criteria. Largest differences be- 
tween criterion groups were obtained for the 
Basic Difficulty scores and for part scores 
titled “About Myself” and “My School” (3). 

The distribution of the total group of 1000 
7th and 8th graders was positively skewed, 
suggesting that most subjects have checked 
few of the Basic Difficulty items and only a 
few have checked a large proportion of them. 
The mean for the total group was 16.7 with 
a standard deviation of 4.4, but a range of 

I. 

In Table 1 several environmental factors 
show up in relationship to the Basic Diffi- 
culty scores when single factors are consid- 
ered one at a time. Significantly more Basic 
Difficulty problems are checked by pupils 
rated low in economic status (on the basis of 
a six-item House and Home scale), by Cath- 
olic pupils, by pupils whose mothers did not 
finish high school and by pupils whose fath- 
ers did not finish high school. Fewer Basic 
Difficulty problems are checked by high eco- 
nomic status pupils, by Protestant pupils 
(versus Catholic), by pupils whose mothers 
finished high school or attended college and 
by pupils whose fathers finished high school 
or attended college. 

If the clinician or researcher is to take into 
account environmental factors in adjustment, 
he must know more about them in relation 
to each other as well as in relation to the 
adjustment measure. The next step was to 
study several of these environmental factors 
in combination. Perhaps economic status, 
educational level of the parents, and other 
factors would be shown to pick out largely 
the same pupils. Differentiating pupil re- 
sponse of the SRA Youth Inventory accord- 
ing to economic status and educational level 
of the parents, along with sex and grade as 
additional controls, this was not found to be 
true. Education of the mothers and economic 
status, as measured, both accounted about 
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TABLE | 


Basic Duirricurty 


Total 


Low income 
High income 


Protestant 
Catholic 


Fathers’ education 
Grade only 
High school graduate 
Attended college 


schvol 


Mothers’ education 
Grade only 
High school graduate 
Attended college 


school 


equally and independently for some of the 
variation in Basic Difficulty scores. The 
analysis of variance is summarized in Table 
2. Mean scores of low and high economic 
status pupils were 19.0 and 14.6 and the mean 
scores of pupils with low and high mothers’ 
education were 19.4 and 14.2, respectively. 
Religion as an environmental factor was ex- 
amined for its presumed effect upon the cri- 
terion attitudes with sex controlled. Here it 
was found that Catholic boys checked more 
problems (21.1 on the average) than Prot- 
estant boys or girls or Catholic girls, who 
checked on the average 15.1, 16.7, and 16.5, 
respectively (see Table 3). 

It appears evident that environmental fac- 
tors such as income level of the home plus the 


TABLE 2 


ANALYsis OF VARIANCE oF 400 JuNIOR Hicu ScHoo. 
Pupits’ Scores ON THE Basic Dtrricutty 
Component oF SRA Youtu INVENTORY 


MEAN 
SQUARE 


SouRCE OF SUM OF 


VARIATION SQUARES 





69. 69.7 


Grade 
Income level 
Mothers’ 

education I 


1879 1879. 

2719.622 2719 

1717. 1717. 

67223.92 175.0 
0775 


Interactions 11 
Within groups 384 
Total 399 


7361 





® Significant beyond the 1 per cent level. 


ScoRES BY 


ENVIRONMENTAL GROUPS 


MEAN STANDARD DEVIATION STUDENT ?’'s 
16.7 
18.1 
13.2 


16.1 
18. 


18. 
13. 
13. 


19. 
14.§ 


home’s educational or intellectual level are 
positively correlated with good adjustment, 
defined as the number of Basic Difficulty 
problems checked on the SRA Youth In- 
ventory. Are these “artifactual” differences? 
Other things being equal, satisfaction of basic 
economic needs in a household might be ex- 
pected to lead to fewer wants and fewer 
problems. Also high education of the parents 
should mean, by and large, a more intelli- 
gent or rational approach to family crises in- 
volving the children than would be obtained 
in a family where ignorance and supersti- 
tion prevail. As for our evidence indicating 
adjustment problems on the part of Cath- 
olic boys more than on the part of other 
children, further research on this problem, 


TABLE 3 


NonorTHOGONAL ANALYsIS OF VARIANCE OF 851 
Juntor HicH Scuoot Pupits’ Scores ON THE 
Basic Dirricutty Component oF SRA 
Youtu INVENTORY 





SuM OF 
SQUARES 


SouRCE OF 


VARIATION aj SQUARE 





Sex 20.5882 20.5882 12 
Religion (Prot. 
vs. Cath.) 
Sex X Religion 

interaction I 955.0552 
Within groups 847 151578.2730 
Total 850 153413.2599 


859.3435 859.3435 4.802° 


955.0552 


5-337°° 
178.9590 —— 





* Significant at 4 per cent level. 
** Significant at 2% per cent level. 
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possibly involving intensity measures of re- 
ligious feeling, is called for. 

Of course only a relatively small portion 
of the variation in these scores has been ac- 
counted for by environmental factors. There 
are other important correlates of adjustment 
untapped here, obviously, just as this was sus- 
pected in other studies where determinants 
of attitudes were sought (2, 4). High income 
groups and groups whose parents have had 
college education have consistently given the 
more culturally and socially approved attitu- 
dinal and information responses (whenever 
outside criteria existed) on such matters as 
attitudes toward citizenship, attitudes toward 
child-rearing, general knowledge, knowledge 
of current affairs, and amount of supersti- 
tion, to mention only a few, than low income 
groups and groups whose parents did not 
complete high school. More basic measures 
—one tapping intellectual level certainly— 
are needed. 

Instruments like the Youth Inventory are 
developed as tools for school administrators, 
teachers, and counselors. Despite the fact 
that environmental factors do not tell the 
whole story of adjustment, in the school sit- 
uation information like the foregoing can 
sometimes be of great importance to edu- 
cators and counselors. Administratively it is 
relatively easy to determine sex, grade, and 
even intelligence level. Factors of the en- 
vironment in many cases may be as easily, 
if roughly, measured in the classroom situa- 
tion. This should be done if it can be dem- 
onstrated, as in this study, that differences 
among environmentally differentiated groups 
are large. Additional normative data based 
on such factors as social and economic status, 
intellectual and emotional levels of the home, 
and other factors of the environment may 
then be profitably used in the evaluation of 
personal problems. 


SUMMARY AND CONCLUSIONS 


The SRA Youth Inventory was adminis- 
tered to 1000 7th and 8th graders throughout 


the country. Significantly more Basic Difh- 
culty probems were checked by the following 
groups of pupils, environmentally differen- 
tiated: 


Those of low economic status. 
Those whose mothers did not finish 
high school. 
Those whose fathers did not finish 
high school. 

4- Catholic pupils. 


There is additional evidence that level of 
education of the mothers and economic status 
are mutually independent as determinants 
of Basic Difficulty problems. 

Catholic boys aré found to check more 
Basic Difficulty problems than Catholic and 
Protestant girls and Protestant boys. 

It is recommended both that additional 
normative data be furnished for instruments 
like the Youth Inventory and that its users 
endeavor to obtain information concerning 
the environmental background of subjects for 
optimal use of personality inventories. 
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NYONE who observes persons in groups 
and the same persons individually 
is forced to conclude that they often 

behave differently in these two general kinds 
of situations. Casual observation would 
seem to indicate that one kind of behavior 
difference stems from the fact that people ob- 
tain release in groups, that is, are sometimes 
more free from restraints, less inhibited, and 
able to indulge in forms of behavior in which, 
when alone, they would not indulge. 

The most often noted instance of such 
freedom from restraint is the behavior of 
persons in crowds. In a crowd, persons will 
frequently do things which they would not 
allow themselves to do under other circum- 
In fact, they may even feel very 
much ashamed later on. Such behavior is 
not, however, limited to crowds. It occurs 
regularly in groups of all sizes and of many 
different types. For example, a group of 
boys walking down the street will often be 
wilder and less restrained than any of them 
individually would be; at an evening party 
persons who are usually very self-conscious 
and formal will sometimes behave quite 
freely; the delegates to an American Legion 
convention, all dressed in the same uniform 
manner, will sometimes exhibit an almost 
alarming lack of restraint. The question 
with which we will concern ourselves is: 
when does this kind of behavior occur and 


stances. 


why does it occur? 

There occurs sometimes in groups a state 
of affairs in which the individuals act as if 
they were “submerged in the group.” Such 
a state of affairs may be described as one of 
de-individuation; that is, individuals are not 


seen or paid attention to as individuals. The 
members do not feel that they stand out as 
individuals. Others are not singling a person 


1 This study was done at the Research Center for 
Group Dynamics at the University of Michigan. 


out for attention nor is the person singling 
out others. 

We would like to advance the theory that, 
under conditions where the member is not 
individuated in the group, there is likely to 
occur for the member a reduction of inner 
restraints against doing various things. In 
other words, many of the behaviors which 
the individual wants to perform but which 
are otherwise impossible to do because of 
the existence, within himself, of restraints, 
become possible under conditions of de-indi- 
viduation in a group. 

If individuals, then, have needs which they 
are generally unable to satisfy because of the 
existence of inner restraints against doing 
certain things, a state of de-individuation in 
a group makes it possible for them to obtain 
satisfaction of these needs. A group situa- 
tion where de-individuation does occur will 
consequently be more satisfying, other things 
being equal, than one where de-individua- 
tion never takes place. We would expect 
groups which do occasionally provide condi- 
tions of de-individuation to be more attrac- 
tive to their members. 

The satisfaction obtained during states of 
de-individuation is only one of many kinds 
of satisfactions which persons obtain in 
groups. Groups help people achieve goals 
which require joint or cooperative action, they 
provide support for opinions and behavior 
patterns, they sometimes satisfy persons’ 
needs for approval and status, and the like. 
Many kinds of satisfactions which groups 
provide and which, consequently, make 
groups attractive to members may be put 
into two incompatible classes: 

1. Those which necessitate individuation 
in the group. Prestige and status in a group, 
for example, require singling out an individ- 
ual and behaving toward him in a special 
manner. Helping members achieve certain 
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of their goals requires paying attention to the 
individual and to his particular needs. 

2. Those which necessitate deindividuation 
in the group. These are the satisfactions 
which result from the lessening of inner re- 
straints which we have discussed above. 

It is clear that these two classes are incom- 
patible in the sense that groups cannot pro- 
vide both individuation and de-individuation 
at the same time. Groups can, however, pro- 
vide both on different occasions. 

Groups which can provide only states of 
de-individuation are probably not very stable. 
Crowds are a good example of this kind of 
group. The momentary and _ evanescent 
existence of crowds is probably due to the 
inability of this type of group to satisfy needs 
requiring individuation. On the other hand, 
groups which can provide only conditions 
of individuation are probably not very satis- 
fying to their members. A_ group, for 
example, in which members were constantly 
being singled out for praise, approval and 
attention would most likely prove frustrating 
in the long run. Groups which succeed in 
being very attractive to their members prob- 
ably provide both types of situations on dif- 
ferent occasions. 

As a beginning toward support of this 
theory concerning the consequences of de- 
individuation we set out, in the present 
study, to demonstrate: 

1. That the phenomenon of de-individu- 
ation in the group occurs and is accompanied 
by a reduction in inner restraint for the 
members. 

2. That groups in which inner restraints 
are reduced are more attractive to their mem- 
bers than groups in which this does not 
occur. 


The attempt was made, in a laboratory 
situation, to provide conditions which would 
facilitate de-individuation in the group and 
would also provide adequate opportunities 


for measurement. To do this, we wanted to 
create a situation in which there would be a 
strong force acting on the members to engage 
in some behavior against which there were 
strong inner restraints. Under such condi- 
tions some groups would be better able to 
create de-individuation situations than others. 
If de-individuation in the group did occur it 
would seem, from our theory regarding the 
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phenomenon, that during such periods of 
de-individuation individuals in the group 
would not be paying particular attention to 
other individuals gua individuals. If this 
were true then, while being attentive to, and 
consequently well able to remember, what 
was done in the group, they should be less 
attentive to and less well able to remember 
which particular member had done what. 

The extent to which inner restraints 
against engaging in the particular behavior 
were reduced should be reflected in the extent 
to which the members showed the behavior 
in question. This measure would undoubt- 
edly be subject to error because of variation 
from group to group in the strength of the 
force acting on the members to engage in the 
behavior. If, however, we find a positive 
correlation between the extent to which the 
behavior in question was produced and the 
extent to which they were unable to identify 
who did what, this would be evidence sup- 
porting our theory of de-individuation in the 
group. 

In those groups in which the restraints 
against engaging in the particular behavior 
were reduced the members would have ob- 
tained more satisfaction from the group situ- 
ation. From our theoretical considerations 
we would consequently expect that the 
groups which did provide the conditions for 
de-individuation would be more attractive to 
their members. 


PROCEDURE 


The subjects (Ss) for the study were males who 
volunteered in various undergraduate classes at the 
University of Michigan to participate in a group 
experiment. Seven volunteers were scheduled for 
each session, but for various reasons (study pressures, 
forgetfulness, etc.) all seven rarely appeared for the 
discussion meeting. Our sample consists of 23 
groups, ranging in size from 4 to 7.2 

When they arrived at the discussion room, Ss were 
seated around a conference table and were engaged 
by the observer in mildly cheerful small talk. This 
procedure was adopted to prevent excessive predis- 
cussion interaction among Ss which we felt might 
introduce additional factors. 

When all Ss had arrived, the experimenter (E) 
directed them to print their first names on cards 
so that each could be identified by the others in the 


2 Fight female discussion groups were also conducted. 
These are not included with our experimental sample of 
male groups because of their considerably poorer memory 
with respect to who said what during the discussion. 
The results for these female groups, however, are in the 
same direction as those herein reported for the males. 
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discussion, and then proceeded to read aloud the 
following statement. The alleged survey and its 
findings are, needless to say, entirely fictitious. 

“The following statement represents a summary 
of an important research project that has recently 
come to the attention of psychiatrists and social 
scientists concerned with problems of personal ad- 
justment among students. Although the results are 
demonstrably reliable, it is believed that additional 
implications can be brought to light by having small 
groups of students discuss their personal views 
relating to these results. 


“A highly representative sample of 2365 students 
(1133 female and 1232 male) on 14 campuses, from 
all social-economic classes and several nationality 
backgrounds, was subjected to an intensive three- 
week psychiatric analysis consisting of repeated 
depth interviews and a battery of sensitive diagnos- 
tic tests. The results show unequivocally that 87 
per cent of the sample possessed a strong, deep- 
seated hatred of one or both parents, ranging from 
generalized feelings of hostility to consistent fantasies 
of violence and murder. A finding of further sig- 
nificance was that those individuals who at first 
vehemently denied having such hostile impulses or 
who were unwilling to discuss their personal feel- 
ings in the matter were subsequently diagnosed 
as possessing the most violent forms of hostility. In 
other words, conscious denial, silence, or embarrass 
ment were found to be almost sure signs of the 
strongest kind of hatred. Of the 13 per cent in whom 
no trace of hostility was found, the great majority 
thought they probably hated their parents and were 
willing to discuss every aspect of their feelings with 
the investigator. 

“In summary, 87 per cent were found by modern 
psychiatric techniques to possess deep-seated resent- 
ments and hostilities toward one or both parents. 
Individuals in this category who most vigorously de 
nied that they had such feelings revealed, at the con 
clusion of analysis, the strongest degree of hatred. 
Thirteen per cent were found to be free of such ag 
gressive impulses. Most of these individuals at first 
thought they were basically hostile and were inter- 
ested in discussing their feelings toward their par- 
ents freely. 

“Discuss in detail your own personal feelings 
toward your parents in the light of these results. 
Try to analyze yourself in such a way as to get at 
the basic factors involved.” 

The Ss were each given a copy of the above state 
ment and were asked to start discussing the matter. 
The discussion lasted 40 minutes 

The discussion material was designed to create 
conditions in which the phenomenon of de-individ 
uation might occur. The particular topic was 
chosen because it was felt that most people would 
have inner restraints against expressing hatred of 
their parents and, in fact, many would not even 
want to admit it to themselves. In preliminary 
experiments, the statement given to Ss did not 
include the part which indicated that those who 
initially denied it later turned out to be the ones 
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with the strongest hatred toward their parents. In 
these preliminary experiments the most frequent 
occurrence was complete avoidance of the topic they 
had been asked to discuss. Including this state- 
ment provided a force on Ss to talk about it. In 
other words, to the degree that Ss accept the state- 
ment, they experience a more or less strong pressure 
to reveal negative feelings toward their parents. 
This, together with the inner restraints against say- 
ing such things, provided the conditions that we 
wished to create. 


Observation Methods 


During the 40-minute discussion an observer cate- 
gorized statements in terms of whether they re- 
flected positive or negative attitudes toward parents 
in the present or the past; positive or negative atti- 
tudes of others toward their parents; impersonal 
theories about parent-child relationships; and 
whether they expressed concern with the interaction 
of group members and the discussion procedure. 
Each contribution to the discussion was categorized 
and recorded next to the name of the person who 
made it together with the length of the contribu- 
tion in seconds. Pauses which lasted for 20 seconds 
or longer were also recorded. In order to permit 
a detailed analysis of the discussion, the observations 
were divided into 3-minute sequential frames. 

Of particular relevance to the hypothesis being 
tested are those contributions which expressed exist- 
ing negative or positive attitudes that the group 
members have toward their own parents, since from 
these we can infer the degree to which there was a 
reduction in the inner restraint against expressing 
negative feelings. 

Experience in our preliminary experiments indi- 
cated that each contribution would have to be 
categorized as an entity. Frequently a statement 
would begin with the implication that the person 
loved his parents deeply and end with an explicit 
denunciation of them. The reverse also appeared 
quite often—the group member would begin to 
describe various hostilities he feels toward his par- 
ents, only to end with a highly favorable over-all 
estimate of them. Such examples made it clear 
that expressions of attitudes toward parents could 
be coded meaningfully only in terms of the con- 
tribution as a whole rather than in terms of specific 
and often contradictory statements within the con- 
tribution. When the observer could not make a 
judgment of the total unit, that is, whether the 
basic feeling revealed toward parents was positive 
or negative, she categorized it as “questionable.” 

To represent the degree to which inner restraint 
against expressing “hatred of parents” was reduced 
in the group, we calculated the difference between 
the number of contributions which expressed nega- 
tive attitudes (categorized as N) and the number 
of contributions which expressed positive attitudes 
(categorized as P). The number of P contributions 
was subtracted from the number of N contribu- 
tions because it was felt that P contributions were 
indications of the nonreduction of restraint. The 
larger the difference, the more successful the group 
had been in reducing restraint against the expres- 
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sion of negative attitudes toward their parents. 
Statements categorized as “questionable” were omit- 
ted from this calculation. Examples of statements 
falling into the two major observation categories 
follow: 

Negative Attitudes (N) 

“Frequently I get very angry at my mother and 
seemingly there’s a good reason; but I don’t get 
angry that way with others.” 

“There are times when my parents are so stub- 
born and bull-headed; they think they know best. 
Sometimes I don’t think so.” 

“No matter how much I try to think that my 
tolks are good to me, the fact remains that they've 
done me wrong.” 

Positive Attitudes (P) 

“I respect my father because he’s got a head on 
his shoulders; he’s more of a leader and a man.” 

“I feel toward my father that if I could be half 
the man he is, I'd be a great success.” 

“I respect my parents for understanding how im- 
portant independence is for the person.” 

The observer ® was trained intensively in pre- 
liminary experiments and in informal practice ses- 
sions. To check reliability, the experimenter inde- 
pendently observed one of the discussion groups. 
Calculating reliability by correlating the number of 
seconds of N in each of the 3-minute observation 
frames for the two observers yields a coefficient of 
.gt. A correlation could not be computed for P 
because, in that group, it occurred too infrequently. 
One would expect it to be of comparable magni- 
tude. 


The Recording of Statements 
To obtain a measure of Ss’ ability to identify 


who had said what in the discussion, the experi- 
menter recorded, as nearly verbatim as possible, 1o # 
statements made by the group members during the 
discussion. The following criteria were employed 
in selecting these statements from the discussion: 

1. The content of the statement should be distinct 
enough to permit identification of the person who 
made it, i.e., the statement should be as dissimilar 
as possible from those made by other group mem- 
bers. 

2. The statement should be about a sentence in 
length. 

3. The statement should be grammatically co- 
herent. 

4. The ro statements should come from as many 
group members as possible. 

5. The 10 statements should be distributed over 
the entire 40-minute discussion period. 

At the conclusion of the discussion, E made sure 
that the name cards were visible, and separated Ss 
to prevent copying. He then distributed a form and 
gave the following instructions: 

“I am going to read off some statements that were 
and some that were not 


made in the discussion 


8 Miss Dorothy Peterson, a graduate student in the 
Department of Sociology. 

4In 7 of the groups only 8 or 9 statements were 
recorded. This was occasioned by a lack of statements 
which fitted the criteria employed. 
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made. If you do not remember the statement hav- 
ing been made, place a check in the first column 
next to the appropriate number. If you remember 
the statement, but offhand you do not recall who 
made it, place a check in the second column next 
to the appropriate number. Finally, if you recall 
who made the statement write the first name of 
that person in the third column. This is not a mem- 
ory test of any kind, and there is no need to guess.” 
The statements were then read off in the same 
temporal order in which they were made in the 
discussion. Interspersed among the 10 statements, 
in constant order for all groups, were 5 statements 
that were not made in the discussion. These were 
included so as to provide a basis for comparing 
“Identification Errors,” i.e., errors in recalling who 
said what, with “Memory Errors,” i.¢., errors in 
remembering the conteat of the discussion. 

The Ss’ responses to the statements were scored 
in the following way. If, on any of the 1o state- 
ments actually made in the discussion, the person 
failed to recall who had made it or if he attributed 
the statement to the wrong person, he was given an 
error. The average number per person of these “Iden- 
tification Errors” was calculated for each group. 
Errors of general memory were calculated similarly: 
Whenever an S thought that a given statement had 
been made which actually had not or whenever an S 
thought that a given statement had not been made 
which actually had, he was given an error. As with 
“Identification Errors” these “Memory Errors” were 
averaged for the group. 

The E, in recording the statements, frequently 
had trouble meeting the criteria mentioned above. 
The statements recorded varied greatly in their 
identifiability. Sometimes a statement would be 
recorded and later on others would make very 
similar statements, thus making the identification 
ambiguous and difficult for Ss. Sometimes, when 
the discussion was proceeding rapidly the experi- 
menter would not be able to record the statement 
accurately and consequently the recorded statement 
would be quite different from what was actually 
said. To cope with these difficulties some of the 
statements were eliminated from the analysis when 
there were good grounds for believing they were 
poor statements. 

The specific criteria used to eliminate the state- 
ment were as follows: In groups of five persons or 
more a statement was eliminated if: (a) all or all 
but one S made errors on it, or (4) all but two 
made errors and the S who made the statement 
erred himself. 

In groups of four Ss a statement was eliminated 
if all or all but one, including the S who made the 
statement, made errors on it. 

When statements had been eliminated the aver- 
age number of “Identification” (7) and “Memory” 
(M) errors was corrected so as to make all groups 
comparable with respect to number of statements. 
The correction consisted of multiplying the number 
of ] and M errors, respectively, by 10 and 15 and 
dividing by the number of statements actually used 
in the counting of errors, i.¢., the number of state- 
ments recorded and not eliminated. 
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The measure used to represent the ability of the 
group to identify who said what was the average 
number of “/-errors” minus the average number of 
“M-errors.” The average number of “M-errors” is 
subtracted in order to correct for general memory 
level of the group. 


The Measurement of Attraction to the Group 


A postsession questionnaire included an item de- 
signed to measure the attractiveness of the group 
for tlhe members. The question and the possible re 
sponses are as follows: 

“Frankly, how nmuch would you like to return 
for further discussions of similar topics with this 
same group (assuming your schedule to be free)?” 

definitely want to return 

fairly strong desire to return 

feel neutral about it 

fairly strong desire not to return 

definitely do not want to return 
Numerical values were assigned to each alternative 
(1 for “definitely do not want to return”; 5 for 
“definitely want to return”) and an average attrac- 
tion score was computed for each group. 


Explanation to the Subjects 

In each group, after the questionnaire had been 
administered, E explained the purposes of the study 
in detail to Ss. They were told that the data pre 
sented for the discussion topic were entirely ficti 
tious and the reasons for using it, together with 
the reasons for the rest of the procedure, were dis- 
cussed with them. Sufficient time was spent in this 
manner with each group for them to leave with a 
good understanding of the experiment. They were 
also asked not to tell others about the experiment 
since we did not want future Ss to know what was 
going to happen in the group. As far as the ex- 
perimenters know, Ss faithfully kept silent about it. 


REsULTs 


There are two relationships with which we 
will be primarily concerned in examining the 
results of this experiment: (a) the relation 
between the frequency of negative attitudes 
toward parents revealed in the discussion and 
the ability to identify who said what, and 
(4) the relation between the frequency of 
negative attitudes revealed in the discussion 
and the attractiveness of the group for its 
members. After examining the evidence on 
these two points we will look at possible 
alternative explanations of the data and evi- 
dence for or against such alternatives. 


De-Individuation 
Restraint 

It will be recalled that our measure of de- 

individuation in a group was the extent to 

which the members of the group were unable 


and Reduction of Inner 
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to identify who said what during the discus- 
sion (/-errors—M-errors). The measure of 
the extent to which inner restraints were 
reduced is the frequency of negative attitudes 
toward parents revealed in the discussion 
(N—P). Figure 1 shows the scatter diagram 
of the obtained relation between these two 
measures. From the theory we elaborated 
above we would expect to find them posi- 
tively correlated. 
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Fic. 1. RELATIONSHIP BETWEEN REDUCTION IN RE- 
STRAINT AND ABILITY TO IDENTIFY WHo Sain WHaT 


Figure 1 shows the scatter diagram of the 
obtained positive correlation between these 
two variables. The correlation, including all 
of the groups, is only .22. One of the groups, 
however, indicated on the figure by an arrow, 
is considerably off the scale on poorness of 
identification of who said what. There are 
grounds for believing that this group was 
affected by a very different factor, namely, 
disinterest in the experiment and in the 
discussion. 

The major grounds for asserting this are 
the great number of pauses in the discussion 
for this group. Observing pauses only of 
20-second duration or longer, this group had 
a total of over 5 minutes of complete pauses 
during the 40-minute discussion. No other 
group had pauses totaling more than one 
and a half minutes. Most of the groups had 
no pauses at all lasting as long as 20 seconds. 

If we can take this as indicative of disin- 
terest and, consequently, attribute the poor 
memory in this group to disinterested inat- 
tentiveness to people, then, considering how 
far off the scale of the other groups it is, it 
may be legitimate to omit this group from the 
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calculations. Omitting this group, the correla- 
tion between the two variables in Figure 1 is 
57- This correlation is significant at the .o1 
level of confidence. Our further presentation 
and discussion of the data will omit this 
deviant group. 

It is also instructive to examine the rela- 
tions between the measure of reduction in 
restraint and the /-errors and M-errors sep- 
arately. We would expect the reduction in 
restraint to be positively correlated with the 
average number of /-errors alone, although 
this correlation should be lower because of 
the uncontrolled general memory factor 
which enters. The correlation obtained be- 
tween N—P and [errors is .21. 

Perhaps a more accurate way to eliminate 
the general memory factor from this corre- 
lation would be to calculate the partial 
correlation of N—P with [errors, holding 
M-errors constant. Table 1 shows the inter- 
correlations among the three variables in- 
volved in this partial correlation. 


TABLE |! 


aMONG /-Errors, M-Errors 
anp N—P 


INTERCORRELATIONS 


M-Errors 


I-errors +24 -31 
M-crrors —.39 





The partial correlation of N—P with /-errors, 
holding M-errors constant, is .45. 

It is interesting to understand why the 
measure of reduction in restraint (N—P) cor- 
relates negatively with the number of M- 
errors. This is probably due to the fact that 
the less the members of a group revealed 
negative attitudes and the more they tended 
to skirt the real discussion topic, the less dis- 
tinctive were the statements which E was 
able to record verbatim during the discussion. 
Consequently, the greater the number of 
negative contributions, the better were they 
able to recall what was said. It is also pos- 
sible that the greater the extent to which 
negative attitudes were revealed in the dis- 
cussion, the more attentive were the members 
to what was being said and, consequently, 
the more adequate their memory. It then 
appears that an increase in the expression of 
negative attitudes toward parents is accom- 


panied by an increase in the inability to 
identify who said what, in spite of a general 
improvement in memory. 


Reduction of Inner Restraint and Attraction 
to the Group 

Since the reduction of inner restraints 
allows the group member to behave more 
freely and to satisfy needs which would 
otherwise be difficult to satisfy, groups in 
which reduction of restraint occurs should 
be more attractive to their members. We 
should then expect to find a positive corre- 
lation between the measure of the reduction 
of restraint (N—P) and the average attrac- 
tion to the group as measured on the postses- 
sion questionnaire. This correlation turns out 
to be .36, which is significant at almost the 
10 per cent level of confidence considering 
both tails of the probability distribution. 

There is evidence, then, supporting the two 
major derivations stemming from the theory 
about de-individuation in a group, namely, 
that it does tend to result in the reduction of 
inner restraints and that its occurrence does 
tend to increase the attractiveness of the 
group for its members. 


Possible Alternative Explanations 

In connection with any set of data for 
which a specific theoretical explanation is 
given the question arises: are there alterna- 
tive and perhaps simpler explanations of the 
data? This question is especially pertinent 
in connection with a study such as the present 
one since a relatively new theory is being 
presented. We will consequently present 
possible alternative explanations which vari- 
ous persons have suggested to the authors 
and discuss whether or not these explanations 
are compatible with the data. 

1. Can the results be explained in terms of 
theories of repression or selective forgetting? 
The Ss have been put in a situation where 
they were virtually forced to reveal attitudes 
which they perhaps considered shameful. It 
is plausible to expect that under these con- 
ditions they would tend to repress the shame- 
ful material and, consequently, the more 
shameful material is expressed in a group, the 
poorer is their ability to identify who said 
what. 

It seems to the authors that any theory of 
repression would predict that the content of 
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what was said would be forgotten as well as, 
or even sooner than, who said it. The data, 
however, indicate that the more “shameful 
material” expressed in the group the Jdetter 
is the members’ memory for what was said. 
It is only the ability to identify who said 
what which is worse. It would seem further 
that if something like “shame” were the 
determining variable, those groups which 
produced more sh meful content would be 
least attractive to their members. The data 
show exactly the opposite result. One may 
conclude that a theory of repression cannot 
adequately explain the results. 

2. Can the results be explained in terms of 
the Ebbinghaus laws of forgetting? Those 
groups which followed the experimental in- 
structions best stated more instances of nega- 
tive attitudes towards their parents and said 
more things of a personal nature. This 
means they produced more statements of the 
kind that E recorded verbatim and tested 
them on. From experiments on forgetting 
we know that the more statements there 


are to be remembered, the poorer will the 
memory be. 
This type of explanation again would 


predict poorer memory for what was said in 
those groups who revealed a lot of negative 
attitudes toward their parents. This is con- 
trary to the facts. It would also seem that 
this explanation cannot handle the obtained 
positive relation with the attractiveness of the 
group. 

3. Can the results be explained in terms of 
division of attention? It is plausible to sup- 
pose that in those groups where more nega- 
tive attitudes were revealed, the members 
became correspondingly more interested and 
engrossed in the content of the discussion. 
Since they were paying so much attention to 
what was said they could not pay attention 
to who was saying it. This theory would 
account for the fact that there is a negative 
correlation between the frequency of nega- 
tive attitudes revealed and number of M- 
errors while a positive correlation exists 
between the former variable and number of 
Lerrors. This theory would also explain the 
relation with attraction to the group. Those 
groups in which more interesting discussions 
took place were more attractive to their 
members. 
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This theory has one additional implication. 
it would imply that one should obtain a size- 
able negative correlation between the number 
of J-errors and the number of M-errors. This 
should certainly be the case if it is a matter 
of division of attention and the more one 
pays attention to content the less attention is 
it possible to pay to who is saying what. The 
actual result is that there is a correlation of 
.24 between these two kinds of errors. This 
would seem definitely to refute the division 
of attention explanation. 

4. Can the results be explained simply in 
terms of individual reactions rather than as 
a group phenomenon? Perhaps those indi- 
viduals who revealed negative attitudes 
toward their parents tended to ignore who 
was speaking because of the shameful nature 
of the content. 

This possible explanation can be refuted 
by an analysis of the data in terms of indi- 
viduals within each group. If it is a matter 
of individual reaction, then, within each 
group, the same relationship should hold that 
we find when we use the group as the unit 
of analysis. Accordingly, we ranked the 
members of each group on the basis of the 
extent to which each member made contri- 
butions to the discussion which revealed 
negative attitudes toward their parents 
(N—P). We then examined the measure of 
ability to identify who said what (J-errors— 
M-errors) in relation to these ranks. 

If we split each group in half on the basis 
of the N—P measure there is no consistent 
difference between the upper and lower 
halves on the ability to identify who said 
what. In 10 of the groups those who con- 
tributed more negative statements made more 
identification errors and in the other 12 
groups the relationship is reversed.® In other 
words, the analysis on an individual basis 
reveals no relationship at all between the two 
measures. 

It might be argued that dividing each 
group into two parts is a rather gross analysis 
and might obscure an existing relationship. 
We consequently also analyzed the data 
dividing each group into four parts. This 
required some arbitrary decisions as to the 

5 This analysis continues to exclude the extremely 


deviant group. If this group were included it would 
make 13 groups where the relationship was reversed. 
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division when there were actually more than 
four N—P scores in a group, but these deci- 
sions could be made fairly easily. When 
there were only three different N—P scores 
in a group, they were analyzed as belonging 
to the top three quartiles when groups were 
combined. When there were only two dif- 
ferent N—P scores in a group they were 
analyzed as belonging to the middle two 
quartiles. In this manner all groups could 


be combined and the average calculated for 
each quartile of J-errors minus M-errors. 
The following tabulation shows these data: 


Quartile division Average 
on N—P N_ /Lerrors—M-errors 
highest quartile 25 -32 
second quartile 32 —.42 
third quartile : -44 
fourth quartile 23 -37 
It is clear that there is no relationship be- 
tween the two measures when analyzed in 
this manner. The variation within any 
quartile is quite large and none of the differ- 
ences even approach statistical significance. 
It might also be argued that this analysis 
does not dispose of the notion that it may be 
individual reaction rather than a group phe- 
nomenon because this analysis obscures the 
absolute magnitude of the N—P measure 
which may be important. The data were 
accordingly also analyzed by individuals 
simply on the basis of absolute amount of 
N—P. The following tabulation presents the 
results of this analysis: 


Absolute score Average /-errors 
on N—P N -—M-errors 
greater than zero 42 .27 
zero 2 04 
less than zero 43 25 


None of these differences are appreciable or 
statistically significant. Once more it is clear 
that analyzing the data by individuals does 
not show the same relationship which was 
found when the data were analyzed by 
groups. This certainly lends support to the 
theory which explains the results as a group 
phenomenon. 


SUMMARY 


A group phenomenon which we have 
called de-individuation has been described 
and defined as a state of affairs in a group 
where members do not pay attention to other 
individuals gua individuals, and, correspond- 
ingly, the members do not feel they are being 
singled out by others. The theory was ad- 
vanced that such a state of affairs results in 
a reduction of inner restraints in the mem- 
bers and that, consequently, the members 
will be more free to indulge in behavior from 
which they are usually restrained. It was 
further hypothesized that this is a satisfying 
state of affairs and its occurrence would tend 
to increase the attactiveness of the group. 

A laboratory study was conducted to test 
this theory and the data from this study tend 
to support it. Other possible explanations of 
the obtained results have been considered and 
found inadequate. 
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AGE AND SEX DIFFERENCES ON THE RORSCHACH EXPERIENCE 
BALANCE 


WILLIAM HAYS 


Lincoln State School and Colony 


GREAT deal of study has been given re- 

cently to the problem of delineating 

the “normal” Rorschach protocol, the 
ultimate aim being a fuller understanding of 
the nature of the deviancy so characteristic of 
most pathological states. It is generally ac- 
knowledged that a great many factors may 
influence the range of deviation we should 
expect of the “normal” Rorschach protocol. 
However, little has been reported concerning 
the effects of the sex and the age of the sub- 
ject upon the normal variations to be found 
in his Rorschach record. 

Rorschach (7) suggested that some varia- 
tion was to be expected in the course of the 
individual's life span, notably in his experi- 
ence type. He indicated that, although evi- 
dence was minimal, some differences were to 
be seen between the sexes in their approach 
and experience-type in the test. Klopfer and 
Kelley (6) pvint out the possibility of varia- 
tion based upon both age and sex of the sub- 
ject, and admit the lack of large-scale investi- 
gations which would explore these differ- 
ences. Ford (3) has made an extensive study 
of Rorschach characteristics of young chil- 
dren, and Hertz (4, 5) has done notable 
study of adolescent records. The most com- 
plete study of an average “healthy” sample 
of adults is that of Beck (1). However, 
Beck’s study is not concerned with age and 
sex variation in the average group. 

There are undoubtedly two very important 
reasons for this lack of evidence concerning 
age and sex differences in the average adult 
Rorschach. One is the manifest difficulty in 
securing large, relatively homogeneous groups 
of adults having wide range of age variation. 
A second, and perhaps more important, is 
the extreme difficulty in making comparisons 
between groups on Rorschach factors. This 
has been true especially of complex factors 
such as experience type, manner of approach, 
etc., which do not lend themselves readily to 
the usual statistical treatments. 

A technique of pattern tabulation has been 


developed recently by Cronbach (2), which 
has the advantage of being especially suited 
for the statistical treatment of Rorschach ex- 
perience balance and similar complex scores. 
This technique permits the comparison of 
groups in terms of the distribution of pat- 
terns of scores; thus, the results can be made 
more psychologically meaningful than can 
simple comparison of groups in terms of 
mean scores or other measures. 

It is the purpose of this investigation to 
apply the technique of pattern tabulation to 
a study of variation in Rorschach experience 
balance with age and sex, in a group of nor- 
mal adults of homogeneous intelligence and 
occupational levels. 


PROCEDURE 


During the three-year period from 1947 to 1950, 
all new employees of the Lincoln State School and 
Colony were given the Rorschach and the Wechsler- 
Bellevue Intelligence Scale. From these employee 
files were drawn the records of 246 individuals who 
were applying for jobs as institutional worker, at- 
tendant, or clerical worker, and who showed a 
Wechsler IQ between 95 and r11. Each record had 
been given and scored by one of three psychologists, 
each of whom had wide experience in Rorschach 
scoring and administration, and who had worked 
closely together during that period. Beck scoring 
was used throughout. In order to lessen the possi- 
bility of different proportions of the various sex and 
age groups having been tested by the different psy- 
chologists, the number of male and female subjects, 
and of subjects over and under 50 years tested by 
each examiner were drawn into contingency tables. 
Chi-square from these tables showed no significant 
deviation from chance for the number of subjects 
of each sex or age tested by each examiner. 

The age range represented in the 246 subjects was 
18 to 65 years. However, it was decided to di- 
chotomize the age range into subjects above and 
below 50 years of age. This age was taken as the 
point of dichotomy since it is mentioned by Ror- 
schach as period of probable change in the experience 
type. Thus the group was subdivided into 88 
females below 50 CA, 71 males below 50 CA, 45 
females 50 or over, and 42 males 50 or above. 
Table 1 present CA characteristics of these groups. 

Cronbach's method of pattern tabulation involves: 
(a) taking three scores constituting a meaningful 
pattern; (4) transforming each score by means of 
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Group 


(F)<50 
(M)<50 
(F)>50 
(M)>s50 
Total 


AcE AND Sex DirFERENCEs ON RorscHacH EXPERIENCE BALANCE 


TABLE | 


CA CHARACTERISTICS OF THE GROUP 


Mean CA 


29.65 
31.86 
54.4 
56.2 

4 


39 


SD CA 


p or Dirr. 





-910 
-720 
+275 
825 
10.361 


Not sig. 


Not sig. 





a percentile distribution into a normalized scale; 


(c) averaging the three normalized scores for 
each person; (d) subtracting the average from each 
of the normalized scores to obtain “profile scores” 
for each person; (¢) plotting each set of profile 
scores on a triangular homogencous-coordinate dia- 
gram; (f) comparing distributions by chi-square for 
various areas.! 

After the pattern scores had been plotted for the 
entire group, the frequency of the occurrence of 
points belonging to members of the four age and 
sex subgroups was recorded for certain areas. For 
this purpose, the triangular coordinate graph was 
divided into four large areas: Area I, showing an 
emphasis upon M in the experience balance; Area 
II, showing a constricted experience balance with 
emphasis upon F responses; Area III, showing an 
emphasis upon C responses; and Area IV, showing 
equal emphasis upon M,C, and F. The distribution 
of the individuals of the various age and sex sub- 
groups is shown in Table 2. Individual cases falling 
on dividing lines for areas were given a frequency 
of one-half for each area. 


1 The reader is referred to the paper by Cronbach (2) 
for a detailed explanation of this method. 


TABLE 2 


Frequency oF Cases IN Eacu AREA 








FEMALE MALE 





AREA <s50CA >s50CA <50CA >50CA Tora 





I Intro. ‘ . . 15 
II Constr. 16.5 
Ill Extra. 
IV Bal. 

Total 
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Before statistical treatment of the frequency of 
occurrence in each area was undertaken, it was 
necessary to examine the differences according to 
age and sex in the general factor of responsiveness. 
Table 3 presentes these mean response characteristics 
by age and sex. The ¢ for each pair of mean dif- 
ferences was not significantly different from chance. 

The technique of chi-square was utilized in the 
comparison of the age and sex subgroups for each 


TABLE 3 


MEAN Tortat RESPONSES 


SD R 


Group 


FOR THE 


Group Mean R p or Dirr. 





14.850 
13.003 
15.051 
12.604 
14.365 


Females 25.95 
Males 23.05 
CA<50 26.20 
CA>50 22.80 

Total 25.00 


>15<20 


>5<10 





area of emphasis. For this purpose 2X2 frequency 
tables were employed. For example,*in Area I 
the dichotomy of Males-Females was taken against 
number of cases in I, number of cases not in I, etc. 
It will be noted from Table 4 that highly sig- 
nificant age differences occurred in Area II (con- 
stricted), Area IV (balanced), while significant sex 
differences were seen in Area III (color emphasis), 
and somewhat less so in Area I (M emphasis). 


Discussion 


The analysis of the patterning of experi- 
ence balance scores for this group seems to 
indicate that there is a marked tendency for 
subjects over 50 years of age to show a con- 
striction of the experience balance and a rela- 
tively heavy emphasis upon F responses. 
Further analysis shows that this trend is 
more pronounced among the female subjects 
over 50 than among the males over 50. It is 
interesting to note Rorschach’s (7) suggestion 
that after age 40, introversive features tend 
to diminish, with progressive trends toward 
coarctation with increasing age. The pres- 
ent finding would tend to support the pres- 
ence of some such trend. However, the pres- 





TABLE 4 


Prospasitity Levers For x2 in CoMPARISONS OF SUBGROUPS 








MXF <soX>50 MXF>s50 MXF<50 


<soX>50 <soX>50_ 
M F 





II Constr. 
Ill Extro. 
IV Bal. 


Ns Ns <s% T Ns 
Ns <1% Ns Ns 
<3% Ns Ns Ns 
Ns <I Ns 


<5s% 
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ent study tends to indicate constriction in the 
female subjects counterbalanced by consid- 
erable introversive inclination in the male 
subjects. It is not impossible that, if this 
trend exists, it has differential effect and rate 
between the sexes. This finding can prob- 
ably be interpreted in the light of the earlier 
onset of the menopause and involutional 
period in the female than the corresponding 
period in the male. Perhaps this trend toward 
constriction in the female subjects over 50 is a 
reflection of postmenopausal involutional 
tendencies in some of the subjects. It would 
be of considerable interest to trace this sex 
difference into the range of senility in order 
to determine if the contriction trend continues 
to operate in both sexes. 

The infrequency of subjects showing a bal- 
anced experience type after age 50 is probably 
a function of the shift toward the constricted 
and the introversive types. This is not the 
case with the groups under 50, however, who 
seem to show a relatively constant dispersion 
over the various areas, with the exception of 
Area III. 

It will be noted that only 48.5 cases fell 
in the “balanced” or “ambiequal” category of 
Area IV. This comprises only 19.7 per cent 
of the entire group. Even if 19 more cases 
found to have a “dilated” experience balance 
are added to this number, only 27.4 per cent 
of the cases are accounted for. This contrasts 
sharply with the normal “Spiegel” sample of 
Beck (1), which is characterized as showing 
“ambiequal, moderately dilated experience 
balance” and a “fluid, resilient psyche.” How- 
ever, Beck’s procedure of generalizing from 
an experience balance formed from the mean 
M and C-sum of his group is questionable 
in the extreme. 

In Area III there appeared to be both a rela- 
tive overfrequency of female subjects, and an 
underfrequency of male subjects. This sug- 
gests some tendency for the overemphasis of 
the experience balance on color to be more 
characteristic of females. Once again, Ror- 
schach has assumed that the number of C 
responses will be somewhat larger in female 
subjects than in males, and the present find- 
ing tends to support this conclusion. No cor- 
responding emphasis in another direction is 
to be seen for the male subjects in this study. 


WituiaM Hays 


Further analysis of our data tends to sug- 
gest that this sex difference in color emphasis 
is more characteristic of subjects under 50 
than those above 50. This trend, however, 
falls slightly short of the level required for 
statistical significance. This sex difference 
might be a function of the fairly heavy 
weighting of the group of females under 
50 with women presumably at early stages 
of the menopause. 

While this study has obvious shortcomings, 
it is felt to show the influence of trends op- 
erating in the variation of experience balance 
with age and sex differences. Furthermore, 
it demonstrates the application of a statistical 
device to a comparison of Rorschach scores 
hitherto impractical for lack of suitable treat- 
ment. It is hoped that further work will be 
done utilizing this and other techniques in 
more detailed analyses of basic Rorschach 
problems, particularly that of exploring the 
characteristics of the “normal” Rorschach 
record. 


SUMMARY 


1. Rorschach records of 246 adults of nor- 
mal intelligence were taken from the em- 
ployee files of Lincoln State School and 
Colony. These were subgrouped according 
to sex and age. 

2. Experience balance of these records was 
analyzed according to the method of pattern 
tabulation. 

3. A significant trend was noted toward 
constriction in subjects over 50 years of age, 
particularly among female subjects. An in- 
troversive trend was also found for male 
subjects beyond 50 years. 

4. Significantly fewer subjects of either sex 
over 50 years showed a balanced experience 
type than did subjects under 50 years. 

5. There was a significant trend for higher 
incidence of color emphasis among female 
subjects. Male subjects below 50 years of 
age showed no marked trends. 

6. It is concluded that experience balance 
dees show variation with both sex and age. 
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HE study by Postman, Bruner, and Mc- 

Ginnies, entitled “Personal Values as 

Selective Factors in Perception” (4), is 
of considerable theoretical significance. They 
conclude that individual value preferences 
are significantly associated with recognition 
time for words presumably representing those 
values. 

The present paper has two purposes: (a) 
to compare the results obtained by using a 
direct measurement of value preferences with 
those obtained by the indirect method used 
by Postman, Bruner, and McGinnies, which 
involves scores on the Allport-Vernon Study 
of Values; (4) to repeat the study of Post- 
man, Bruner, and McGinnies (hereafter re- 
ferred to as PBM) with the addition of a 
more satisfactory statistical procedure. 

A current view in psychology maintains 
that human behavior can be most effectively 
understood and predicted in terms of factors 


operant within the phenomenological field of 
the individual (5s). This field is seen to be 
organized around the phenomenal self. The 
assumption is made that “once the phenom- 
enal self has become established, experience 
thereafter can only be interpreted in terms of 


that self. Thus all perceptions which are 
meaningful to the individual derive their 
meaning from the relation they bear to the 
phenomenal self already in existence” (6, p. 
85). 

If these assumptions are true, then how 
does the phenomenal self influence the or- 
ganization of the total phenomenal field? 
Solely through the processes of accentuation 
and fixation, i.e. by inducing differential 
focusing upon the data immediately present 
in the field of awareness, both from extero- 


1 This research was supported in part by a Counseling 


Center Research Fellowship, University of Chicago. 


of 


394 


FISKE 


Chicago 


ceptive sources and from memory? Or could 
it be that the phenomenal self exercises some 
influence at an even prior stage in the per- 
ceiving process, during the selection which 
determines which data shall enter the field of 
awareness? 

We are asking here a question which is 
very simiiar to the one which is raised in 
the PBM study. Each of these questions 
leads to an investigation of possible relation- 
ships between the factors which have been 
posited as central determinants of personality, 
and the exteroceptive perceptual behavior of 
the individual. Postman, Bruner, and Mc- 
Ginnies find a partial answer to the question 
when they conclude from their study that 
there is a significant association between 
personal value preference as measured by the 
Allport-Vernon Study of Values, and the 
recognition time for words associated with 
these values. Thus, they show a relationship 
between one kind of central personality de- 
terminant (value preference) and one kind 
of exteroceptive perceptual behavior (recogni- 
tion of rapidly exposed words). 

Because of the theoretical significance of 
these Postman, Bruner, and McGinnies find- 
ings, and because a preliminary study by one 
of the present authors failed to corroborate 
their results,” it was decided to repeat their 
study. 


Tue ExpERIMENT 


Twenty-seven subjects, volunteers from an intro- 
ductory psychology class at the University of Chi- 
cago, were shown thirty-six words, one at a time, 
in a Harvard-Dodge tachistoscope with illumination 


2 This preliminary study is ambiguous in outcome and 
is merely of historical interest here because it explains 
why the present study was undertaken. This study is 
reported in Haigh’s thesis (2, pp. 86-98). 
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fixed at approximately .6 foot-candles.2 The words 
were originally selected by PBM to represent the six 
value categories measured by the Allport-Vernon 
Study of Values. They are listed by value category 
in Table 1. 

These words were shown to the subject in ran- 
dom order, using the same procedure as described 
by PBM. To obtain an independent measure of per- 
sonal value orientation for these words, the Allport- 
Vernon Study of Values was administered to each 
subject within four weeks after tachistoscopic pres- 
entation. 

A second method for measuring value preference 
was used in the present study, but not in the PBM 
study. At the same time the subjects took the All- 
port-Vernon test, they were asked to rank the stimu- 
lus words in the order of their liking for them. The 
procedure was defined as follows: 


1. All words were to be put into nine piles, with the 
words liked the most at one extreme, and the words 
liked the least at the other extreme. 
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There are, thus, two intermediate assumptions 
involved in using this method of value preference. 
On the other hand, this indirect procedure posits a 
set of general values, an assumption which has 
wider theoretical implications than those extending 
from the simple indication of relative preference for 
single words. Thus, if the results utilizing the in- 
direct method are confirmed by those from the 
direct procedure, the assumptions are justified. 
Even more important, such corroboration would in- 
dicate that the indirect method of measuring pref- 
ences could safely be used in investigations where 
the true purpose had to be concealed from the 
subjects. 


ANALYsIS OF RESULTS 


Indirect method of measurement. Is time 
of recognition significantly influenced by the 
value which a given stimulus word repre- 
sents? As in the PBM experiment, each 


TABLE 1 


STIMULUS 


THEORETICAL Economk AESTHETIC 


Worps REPRESENTING THE Six ALLPORT-VERNON 


VaLue CATEGORIES 





SociaL PoLiTIcaL RELIGIOUS 





income 
useful 
wealthy 
finance 
economic 
commerce 


theory 
verily 


beauty 
artist 
poetry 
elegant 
literary 
graceful 


science 
logical 
research 
analysis 


prayer 
sacred 
worship 
blessed 
religion 
reverent 


lov ing 
kindly 
devoted 
helpful 
friendly 
sociable 


govern 
famous 
compete 
citizen 
politics 
dominate 





2. The words in each pile were to be put into rank 
der, with the one liked most at one extreme, and the 
one liked next beside it, and so on, The net result 
was a rank ordering of all thirty-six words with refer- 
ence to the subject's preferences, i.e., the extent to which 
the subject liked each word in relation to all the words. 


The direct method was introduced after consid- 
eration of the circuitous connection in the original 
design between the subject’s value preferences and 
the stimulus words. To review this indirect pro- 
cedure, the subject is first asked, via the Allport- 
Vernon, to make choices describing his own be- 
havior, or indicating his interests. Each of these 
choices is then assumed to be associated with one 
of the six Spranger value categories, and thus a 
measure for each of these categories is obtained by 
the proportion of choices which are found to fall 
within it. Each of the stimulus words is also 
assumed on the basis of unanimous classification by 
a group of judges to be associated with a particular 
Spranger value category. 


8 Actually, they were shown 73 words, but the other 
group of words refers to a separate aspect of the study 
which will not be reported on here. That this other 
aspect of the study did not affect the part reported on 
here is suggested by the similarity of results here pre- 
sented to those obtained in the PBM study. For details 
on this other aspect of the study, see Haigh (2). 


subject’s value profile was compared with his 


time-of-recognition profile. These psycho- 
graphs were found to be quite similar to the 
ones which were plotted from the PBM re- 
sults. Visual inspection suggests that in this 
sample, as in the PBM sample, time of recog- 
nition varies as a function of value. 
Following the statistical treatment em- 
ployed in the PBM study, the value scores 
of each subject in this study were classified as 
falling above or below the population mean 
for the Allport-Vernon test. His time-of- 
recognition scores were similarly divided into 
those falling below or above his own mean 
time of recognition. Combining the results 
for all subjects into a two by two contingency 
table, a chi-square test of independence was 
performed. The obtained chi-square value of 
9.35 is quite close to the value (11.87) ob- 


4 Because the evidence of these psychographs is merely 
suggestive, but in no sense conclusive, they are not 
reproduced in the present paper. They may be found in 
the Haigh thesis (2). 
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tained by Pustman, Bruner, and McGinnies. 
There is a striking similarity between the 
contingency table constructed from these data 
(cf. Table 2) with the table constructed from 
the PBM data. If this statistical method for 
handling the data were adequate, then these 
results would show that the association be- 
tween value orientation and time-of-recogni- 
tion is not random. 

Critique of statistical procedure. The re- 
sults presented thus far agree with those of 
Postman, Bruner, and McGinnies. However, 


it must be noted that the fundamental statis- 
tic involved both in the PBM study and in 


the above analysis is the chi-square test of 
independence. The psychographs are merely 


most common error detected by them in stud- 
ies employing chi square. 

It will be recalled that in the PBM study 
and in the present study the chi-square values 
were obtained by combining the recognition 
times for all subjects, making for a total chi- 
square value as contrasted with a pooled chi- 
square, in which chi-square values are com- 
puted for each subject and then added. Lewis 
and Burke say: “Whenever individual sub- 
jects each make more than one response per 
trial, linkages among the measures within 
the trial must result unless there are no indi- 
vidual differences” (3, p. 443). Visual inspec- 
tion of our data reveals that there are indi- 
vidual differences as to range of recognition 


TABLE 2 


Cu1-Souare Test oF SIGNIFICANCE OF ASSOCIATION BETWEEN VALUE PREFERENCE AND TIME OF RECOGNITION 
(Theoretical frequencies are in parentheses) 








Vatue Scores 





AsoveE MEAN Betow MEAN 





Above mean 


Time of recognition 
— Below mean 


167 211 
(185.6) (188.4) 
282 235 
(259.4) (257.6) 
449 446 
P<.01 





suggestive, certainly not conclusive, in point- 
ing to a relationship between value and per- 
ceptual orientation. The only significant dif- 
ferences obtained in the PBM study among 
the recognition times for value ranks were 
those between rank 1 and rank 6, and between 
rank 1 and rank 5. It is possible, in the ab- 
sence of an analysis of variance, that these 
differences might have been derived from the 
tails of a random distribution—that, in other 
words, the relationship can be explained on 
the basis of chance. The conclusion that 
value is a determinant of perceptual response 
rests primarily upon the evidence furnished 
by the chi-square test of independence. How- 
ever, the use of total chi square as employed 
in the present study and in the PBM study 
seems to be questionable. Lewis and Burke, 
in a paper on the use and misuse of the chi- 
square test (3), cite lack of independence 
among the single events or measures as the 


times. (Statistical corroboration of these in- 
dividual differences is supplied below in the 
results of the analysis of variance.) These 
individual differences lead to dependence 
among the events measured, and so violate 
an assumption required for the use of chi- 
square. 

Two other methods for analyzing these 
data are possible. (a) Chi square could be 
computed for each subject, and the resulting 
values could be pooled. One objection to this 
technique is that the chi square for each 
subject would be based on a total N of 36 
which would yield rather small theoretical 
frequencies for each cell. A variation on this 
approach would be to use the Fisher-Yates 
exact method (1, pp. 96-97) to obtain a prob- 
ability value for each subject; these probabil- 
ities could then be combined by means of chi- 
square (1, pp. 99-101). (4) An analysis of 
variance can be performed to test the effect of 





PersoNaL VaLues As SELECTIVE Factors IN PERCEPTION 


TABLE 3 


ANALYSES OF VARIANCE OF RECOGNITION TIMES 








A. INDIRECT MEASUREMENT OF VALUE 
(using Allport-Vernon) 


B. Direct MEASUREMENT OF LIKING 





Source of variance F Pp 
Subjects 55.64 
Values 5-19 
Subjects X values 1.05 


-OOI 
-OO!I 


>.-05 





Source of variance F 
Subjects 55-50 
Liking rank (by sixths) 3-33 


Subjects X ranks 1.59 >.05 





value preference upon recognition times. 
This alternative was used in the present 
study. 

Results of analysis of variance. The recog- 
nition times were arranged in terms of values 
as measured by the indirect method, and an 
analysis of variance was carried out. The re- 
sults, shown in Table 3a, indicate that there 
are marked individual differences in mean 
recognition times. More important is the 


corroboration of the fact that recognition 
time is influenced by value preferences. 
Direct method of measurement. The same 
recognition times for each of the 36 stimulus 
words were reanalyzed in terms of each 
subject's rank order of preference. To make 
the results more comparable with the preced- 


ing analysis, each subject’s rank order was 
divided into six groups of six words each: 
words ranked 1-6 were put in the first group; 
words ranked 7-12 in the second, etc. 

An analysis of variance was performed on 
these data, with the results shown in Table 
38. The contributions of subjects and of 
value groups are highly significant. In 
neither of the analyses reported in Table 3 
is the interaction variance significant. 

Since each analysis of variance indicated 
that value preferences were associated with 
recognition times, the various pairs of means 
in each analysis (cf. Table 4) were tested by 


the ¢-test. The results are shown in Table 
5. The major finding is that the recogni- 
tion time for least preferred words tends to be 
significantly longer than the time for other 
words. 


CoMPARISON AND Discussion 


The present analysis of the relationship be- 
tween recognition time and value preferences 
has yielded results similar to those found in 
the PBM study. Positive values tend to be as- 
sociated with shorter recognition times. The 
agreement between the results of the two 
studies is of great importance, especially in 
view of the level of statistical significance of 
the present findings. 

Certain differences between the studies 
may be noted, however. The range of the 
PBM means is twice that of our means. Also 
the slowest mean recognition times in our 
study (.064 and .066) are faster than the 
fastest mean time (.075) in the PBM data. 
Furthermore, the means reported by PBM 
suggest that the relationship between prefer- 
ence and recognition time is most marked for 
the most highly preferred value, whereas the 
present study suggests that the relationship 
is most marked for the Jeast preferred value. 
These trends apply only to the range of value 
preferences sampled; more extreme values 
might show different trends. 





TABLE 4 


Mean REeEcocniTion Time sy VAtue Grovupincs 








A. INDIRECT MEASUREMENT 
Rank OrpeR oF ALLPORT-VERNON 
VALUES BY PREFERENCE 


Bs. Drrect MEASUREMENT 
Rank Orper or Expressep Lrxincs 
FoR SEPARATE Woros 





Mean recognition time 
(in seconds) 


-058 .055 


4 


-057. .060 .060 .064 


I 2 3 4 5 6 
(1-6) (7-12) (13-18) (19-24) (25-30) (31-36) 
-054 .056 .@59 «061 -060 .066 
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TABLE 5 


SIGNIFICANCE OF DIFFERENCES BETWEEN MEANS FOR 
VaLuc GROUPINGS 


1a Indirect 
b Direct 
2a Indirect 
b Direct 
3a Indirect 
b Direct 
ga Indirect 
b Direct 
sa Indirect 
Direct 


* Significant at the .05 level. 
** Significant at the .o1 level 
*** Significant at the .oo1 level. 


The most plausible way to account for these 
differences seerms to be in terms of the in- 
structions to the subjects. In the present study, 
when subjects seemed to be waiting until 
reasonably certain of the presented word, 
they were urged to guess. Apparently in the 
PBM study subjects were merely instructed to 
report what they saw without being urged 
during the experiment. It is likely that the 
faster reaction times in our data are asso- 
ciated with their reduced variation. 

Of particular interest is the comparison of 
the two methods of measuring values. The 
direct method yields a clearer trend in the 
group means, with a larger range. However, 
the similarity in the over-all trends indicates 
the acceptability of the two assumptions in- 
volved in the indirect method. Furthermore, 
these results suggest that the indirect method 
can be used with confidence under conditions 
where the purpose of the research must be 


hidden.® 


5 Since this paper was written two studies (5, 7) have 
indicated a relationship between the frequency of a given 
word in material, and the duration threshold 
(recognition time) for that word. The paper by Post- 
man and Schneider (5) also shows that the PBM finding 
holds for relatively infrequent words, but not for rela 
tively frequent words. The mean log frequency for the 
words used by PBM and the present authors is 2.10 
(using the Thorndike-Lorge “L” list [8]), which is 
close to the mean value for the Postman-Schneider list, 
but considerably above that for the Solomon-Howes list. 

It may be worth noting that these two most recent 
obtained duration thresholds considerably above 
of either the PBM study, or that reported here 


reading 


papers 
those 


SUMMARY 


This study examined the effect of value 
preferences for 36 words upon time required 
to recognize these words when presented 
tachistoscopically. The value preferences were 
measured in two ways. (@) Each subject di- 
rectly expressed his preference by ranking the 
words. (4) Each subject completed the All- 
port-Vernon Study of Values; since the word 
list contained six words representing each of 
the six value categories on the Allport-Vernon 
test, the relative value preferences for each 
group of words could be inferred from his 
test profile. This indirect method was used 
by Postman, Bruner and McGinnies (4), and 
part of this study repeated their work. 

There was considerable agreement between 
the results of the previous study and those of 
the present one using the second indirect 
method of measuring preferences. Both 
showed a relationship between value prefer- 
ence and speed of recognition. The conclu- 
sions of the PBM study are clouded by the 
use of an equivocal statistical procedure. 
However, the present data were analyzed by 
a more satisfactory statistical technique, so 
that we may accept the conclusions of the 
earlier study. 

The two methods of measuring preferences 
showed similar results. However, the direct 
method yielded a clearer trend with sharper 
discrimination between the extreme value 
categories. 


REFERENCES 
. Fisuer, R. A. Statistical methods for research 
(11th Ed.) New York: Hafner, 1950. 
. Hatcn, G. The role of value and threat in 
orientation. Unpublished doctoral dissertation, 
University of Chicago, 1950. 
. Lewrs, D., & Burke, C. The use and misuse of the 
chi-square test. Psychol. Bull., 1949, 46, 433-489. 
. Postman, L., Bruner, J. S., & McGinnies, E. Per- 
sonal values as selective factors in perception. J. 
Psychol., 1948, 43, 148-154. 
& ScuneimwerR, B. H. Personal values, 
Psychol. Rev., 1951, 


workers 
perceptual 


abnorm. so 

. Postman, L., 
visual recognition, and recall 
58, 271-284. 

. Snyoc, D., & Comes, A. W. 
New York: Harper, 1949. 

. Socomon, R. L., & Howes, D. W. 
personal values, and visual deviation 
Psychol. Rev., 1951, 58, 256-270. 

. THornpixe, E. L., & Lorce, I. The teacher's word 
book of words. New York: Teachers’ 
College, Columbia University, 1944. 


Individual behavior. 


Word frequency, 
thresholds. 


370,000 


Received by Journal of Consulting Psychology, July 19, 
1951. 





AN INVESTIGATION OF THE DIAGNOSTIC VALIDITY OF 
THE SZONDI TEST 
PAUL H. MUSSEN 


Ohio State University 


AND 


SHIRLEY R. KRAUSS 


University of Wisconsin 


EspiTe the fact that there are almost 

no adequate data on the validity of 

the Szondi test, its use as a psycho- 
diagnostic instrument in clinics and hospitals 
appears to be increasing. Deri’s Introduc- 
tion to the Szondi Test (1), the only authori- 
tative work available in English, and the 
publicity given the test have excited wide- 
spread interest in it. 

Few studies designed to test the basic as- 
sumptions or the diagnostic validity of the 
Szondi test have been reported. Rabin (4) 
has shown that students and psychologists 
can identify the diagnosis represented by the 
pictures with significantly better than chance 
results. Guertin (3) found that the distribu- 


tion of picture choices differs significantly 
from what would be expected on the basis of 
chance alone. Fosberg’s (2) four-part study of 
the Szondi has been the most extensive to 
date. He compared the positive and negative 
choices of one hundred normals, fifty paranoid 
schizophrenics, and fifty mixed psychotic 


cases for each factor. Although he found 
that chance is not the only determiner of 
these choices, only the p+ factor distinguished 
the records of normal from abnormal people. 
Contrary to what would have been expected 
on the basis of Szondi’s theory, the s factor, 
showed “no differential frequencies for pre- 
and post-electrically induced grand mal seiz- 
ures,” while the “Szondi sex vector [factors 
A and s] is not sensitive to recent vs. remote 
sex activity” (2, p. 43). He therefore con- 
cludes that “the Szondi in its present form 
should not be used in a clinical situation” (2, 
p. 43). 

These studies have generally been con- 
cerned with limited aspects of the Szondi 
test and their conclusions have been based 
on only one or two administrations of the 
test. For these reasons, they cannot be con- 


sidered adequate tests of the diagnostic valid- 
ity of the Szondi. Deri makes it clear that 
“the administration of the test has to be 
repeated at least six, preferably ten, times, 
to be able to give a valid clinical interpreta- 
tion of the personality” (1, p. 14). 

The present study, which is based upon re- 
peated administrations of the test, was de- 
signed to investigate a number of Deri's 
statements about diagnosis and personality 
empirically. Although there are many warn- 
ings that diagnosis must be concerned with 
“constellations of factors,” “configurational 
pattern of the total test,” or “patterning of 
the total series of test profiles,” Deri makes 
many statements about the clinical and diag- 
nostic meanings of individual factors. We 
wish to examine the validity of these state- 
ments by determining whether or not the 
choice reactions which Deri claims to be 
characteristic of well-adjusted or maladjusted 
individuals are, in fact, typical of these people. 
More specifically, this investigation is con- 
cerned with two major questions: (a) Will 
normal subjects and neurotic subjects differ 
as Deri predicts they will in the types of re- 
sponses made to the Szondi test? (4) Will 
improvement as a result of psychotherapy be 
reflected in changes in choice reactions on 
the Szondi test? 


SuBJECTs AND PRocEDURE 


Our subject population consisted of 40 students 
at the University of Wisconsin. Twenty of these 
students, 13 men and 7 women, were patients at 
the Neurospychiatric section of the University’s 
Student Health Service. All of these students 
were diagnosed as neurotics, and all of them re- 


1 The authors wish to express their appreciation to 
Dr. Annette Washburne and Dr. Thelma Hruza of the 
Department of Student Health of the University of Wis- 
consin for their assistance in obtaining the neurotic 
subjects. 
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ceived psychotherapy. Regardless of the specific 
diagnostic category into which they were finally 
placed, all of the students came to the clinic for 
help voluntarily because they felt maladjusted, 
tense, and unhappy and were experiencing strong 
feelings of conflict. The control group of normal 
students consisted of 20 subjects, 10 men and 10 
women, who did not now feel, and had never felt, 
the need for psychotherapeutic help. 

The Szondi test was administered to each of the 
patients six times (with one- or two-day intervals 
between administrations) when he first came to 
the clinic. Each patient took the test six times 
again after two or three months of treatment. By 
this time, the mean number of therapy sessions com- 
pleted was 11.9, and all patients had had at least 
six therapeutic conferences. All the patients were 
considered to be making some progress in therapy, 
13 of them being rated as “improved” or “much 
improved.” 

The 20 normal control subjects volunteered to 
take the test six times with one- or two-day inter- 
vals between test sessions. 

The plan of the investigation (which is paralleled 
by the plan of the presentation) was as follows. We 
selected statements or series of related statements 
(direct quotations from Deri’s book) which involve 
characteristics of one or both of our subject groups. 
These statements, then, have important diagnostic 
implications. On the basis of each quotation and 
its implications, two specific predictions were made 
with respect to the differences between (a) the 
choice reactions which would be made by the 
normal and neurotic groups and (4) the initial, 
pretherapy responses of the neurotics and their 
responses after two or three months of therapy. 
In each case, the prediction with respect to the 
changes as a result of therapy parallels the predic- 
tion about differences between the normal and 
neurotic groups. That is, it is assumed that the 
reactions of the neurotic group after it has begun 
to improve as a result of therapy should become 
closer to the “normal” types of reaction described 
by Deri. 

These predictions were then checked by determin- 
ing whether or not the Szondi reactions of the 
two groups—and the pretherapy and posttherapy 
reactions of the maladjusted group—can be dis- 
criminated from each other in the way Deri’s 
statements would require. If Deri’s statements are 
valid, our predictions would be confirmed. 

It should be noted that we selected only state- 
ments about factors which had clear-cut meaning 
for our two subject groups (and, consequently, for 
changes as a result of psychotherapy). Thus, if it 
was stated that a particular factor (for example, 
ambivalent p) can be used diagnostically only when 
it is evaluated in terms of its relationship with 
other factors (for example, k+ or kK—), we made 
no predictions involving this factor. If a factor 
(for example, d+) was said to be valuable pri- 
marily for diagnosing psychosis rather than neu- 
rosis, there were no predictions with respect to this 
factor. 
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STATEMENTS AND PREDICTIONS 


Statement 1. Open choices indicate that 
“the corresponding need is not in a state of 
tension” (p. 27).” Since normal people are 
generally less tense, happier people than neu- 
rotics, it may be assumed that, in comparison 
with neurotics, fewer of their needs are in 
states of tension. 

Prediction 1a. Normals will have a greater 
number of open reactions than will the malad- 
justed group. 

Prediction 16. After therapy, neurotics will have 
a greater number of open reactions than they did 
before therapy. 


Statement 2. “The basic ambivalence to- 
ward a given emotional need as indicated by 
the plus-minus [i.c., ambivalent] reaction is 
subjectively experienced as conflict.” “Am- 
bivalent reactions indicate areas where the 
conflicting contradictory tendencies . are 
subjectively experienced” (p. 36). Malad- 
justed people have more subjective feelings 
of conflict than normals. 


Prediction 2a. The neurotic group will have a 
greater number of ambivalent reactions than the 
normal group. 

Prediction 2b. After therapy, the maladjusted 
group will have fewer ambivalent reactions than 
they had formerly. 


Statement 3. “Generally for individuals 
with a constant plus s it is characteristic to 
face and fight reality rather than to withdraw 
into themselves” (p. 74). “Facing and fight- 
ing reality” is more characteristic of well- 


’ 


adjusted people while “withdrawing” seems 
to be a more characteristic response for neu- 
rotics. 

Prediction 3a. Normals will make more s+ 
responses than neurotics. 

Predictions 3b. After psychotherapy, neurotics 
will make more s+ reactions than they did before 
therapy. 

Statement 4. “Minus s constellation means 
tension in the area of aggression but not ac- 
cepted by the person” (p. 75). Repressed 
aggression is often the basis of maladjust- 
ment and neurosis, especially in young adults. 


Prediction ga. Minus ¢ reactions will be more 
frequent among neurotics than among normals. 

Prediction 4b. Minus s reactions will be less 
frequent among neurotics after they have received 
psychotherapy. 


2 All direct quotations are from Deri’s book (1). 
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Statement 5. “Plus-minus s refers to an 
ambivalent way of handling aggression” (p. 
78). Such ambivalence is generally indicative 
of maladjustment. 

Prediction 5a. Ambivalent s will appear more 
often in the records of neurotics than of normals. 

Prediction 56. Ambivalent s will appear more 
often in the pretherapy records of neurotics than 
it will in later records. 


Statement 6. “Open s is an indication for 
continuous discharge in the area of activity or 
aggression. By all means it is an active pic- 
ture. .. . It can be seen in efficient behavior, 


[and] in busy people” (p. 79). Activity and 
efficient behavior are more characteristic of 
normal people than of neurotics. 


Prediction 6a. Open s reactions will be given 
more frequently by normals than by neurotics. 

Prediction 66. Open s reactions will be given 
more frequently by neurotics after therapy than 
before therapy. 


Statement 7. “The minus ¢ constellation 
is obtained from people who are likely to have 
aggressive outbursts.” “The ‘minus e¢ state’ 
is experienced by the subject as strong emo- 
tional tension with no positive mechanisms 
of control.” “Invariably, a constant minus ¢ 
results in general restlessness, and a tendency 
to act out id impulses spontaneously” (p. 93). 
These descriptions of people with minus ¢ 
reactions are more applicable to neurotics 
than to normals. 

Prediction 7a. Neurotics will have more minus 
¢ reactions than normals will. 

Prediction 7b. Neurotics will have fewer minus 
¢ reactions after therapy than they had before it. 


Statement 8. “The plus-minus e¢ constella- 
tion indicates ambivalence in the subject’s 
way of handling aggression. This ambiva- 
lence is experienced subjectively as an emo- 
tional conflict and is likely to lead to periodic 
outbursts” (p. 94). The subjective experienc- 
ing of emotional conflict is more typical of 
maladjusted than of well-adjusted people. 

Prediction 8a. The plus-minus ¢ constellation 
will be found with greater frequency in the malad- 
justed group than in the normal group. 

Prediction 8b. The plus-minus ¢ constellation 
will be found with greater frequency in the neu- 
rotic group before therapy than afterward. 


Statement 9. “The open ¢ constellation in- 
dicates that there is no tension in the area 
of emotional control, which means simply 
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that emotions can be discharged readily” (p. 
95). Lack of tension in the area of emo- 
tional control may be assumed to be more 
common among normals than among neu- 
rotics. 

Prediction 9a. Normals will have more open ¢ 
reactions than neurotics will. 

Prediction 9b. Neurotics will have more open ¢ 
reactions after therapy than they had before 
therapy. 


Statement 10. “Minus Ay is characteristic 
of those individuals who are either unwilling 
or unable to demonstrate their feelings in an 
overtly perceptible way” (p. 103). “Psycho- 
logic characteristics which accompany the 
minus Ay constellation include a vivid phan- 
tasy life, a tendency for daydreaming, and an 
ability for playful, ‘prelogical’ thinking. The 
dynamics of all these traits can be derived 
from the fact that emotions are not acted 
out, but rather are felt as an inner, sub- 
jective experience” (p. 104). “The most com- 
mon indication of anxiety is a strong minus 
hy reaction” (p. 105). Unwillingness or in- 
ability to demonstrate feelings, vivid phantasy 
living, tendency toward daydreaming and 
anxiety are important aspects of maladjust- 
ment and neurosis. 


Prediction 10a. Minus hy reactions will occur 
more frequently in the neurotic group than in the 
well-adjusted group. 

Prediction 10b. Minus Ay reactions will occur 
more frequently in the pretherapy than in the 
later tests of the neurotic group. 


Statement 11. “As in all ambi-equal reac- 
tions, the plus-minus Ay constellation reflects 
subjectively experienced conflict and tension 
in the individual. In this case it reflects con- 
flict in those who cannot resolve to conceal 
or disclose their feelings” (p. 106). Sub- 
jectively experienced conflict and tension 
which is related to the concealment or dis- 
closure of feelings are features of neurotic 
symptomology. 

Prediction ita. Neurotics will have a greater 
number of plus-minus Ay reactions than normals. 

Prediction 11b. Neurotics will have a greater 
nurnber of plus-minus Ay reactions before they have 
psychotherapy than they will after some improve- 
ment has taken place. 


Statement 12. “The plusminus d con- 
stellation indicates strong and subjectively ex- 
perienced tension in the area of object rela- 
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tionships. There is ambivalence in regard to 
looking for more or new objects or to cling- 
ing to the old ones. .. . In spite of their great 
need for intensive attachment and loyalty to 
one person (minus @), they have a tendency 
constantly to get involved in new relation- 
ships which might thwart the continuation 
of relationships they have established” (p. 
127). The subjectively experienced tension 
and the difficulties in object-relationships de- 
scribed tend to be the qualities of untreated 
neurotics rather than of normals or neu- 
rotics who have had psychotherapy. 
Prediction 12a. Plus-minus d constellations will 
occur in the records of the neurotic group more 
frequently than in the records of the well-adjusted 


group. 
Prediction 126. Plus-minus d reactions will be 

more common in the pretherapy records of the 

neurotics than they will be in the later records. 


Statement 13. “The open d reaction indi- 
cates lack of tension in the area of object- 
relationship; in other words, the problem of 
whether to hold on to old objects or to search 
for new is not particularly important to sub- 
jects yielding open d” (p. 129). The lack of 
tension and unconcern about the problem 
mentioned above are more characteristic of 
normals than they are of neurotics. 

Prediction 13a. There will be more open d re- 
actions among the normal than among the malad- 
justed group. 

Prediction 136. There will be more open d in 
the records of the maladjusted group after they have 
had therapy than there was earlier. 


Statement 14. “Plus m generally indicates 
a warm social attitude and is given by sub- 
jects who not only are in need of positive 
emotions from others, but who also are able 
to give love and affection to others” (p. 136). 
“Well-adjusted adults who are able to form 
and maintain satisfactory social relations are 
expected to give plus m” (p. 137). The easy 
expression of love and the ability to form and 
maintain satisfactory social relations are traits 
of normals, but not of neurotics. 


Prediction 14a. Normals will give more plus m 
reactions than neurotics do. 

Prediction 146. Maladjusted persons who have 
had therapy will give more plus m responses than 
they did previous to having therapy. 


Statement 15. “Negative choices in the m 
factor represent a denial of the need to lean 
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on others. They indicate a frustration of the 
early oral needs but, opposing the still opti- 
mistic attitude characteristic of plus m, sub- 
jects yielding the minus m have given up 
hope of compensating for early frustration 
through new social contacts. Instead, there is 
withdrawal and a certain sadness and cold- 
ness in interpersonal relations. Minus m 
subjects are lonely even though they may 
feel a great need for dependence on and 
gratification from external objects. 

“In adults minus m is always a sign of un- 
happiness” (p. 139). These feelings of lone- 
liness, unhappiness, and hopelessness, to- 
gether with coldness in interpersonal rela- 
tions, generally denote the presence of mal- 
adjustment. 


Prediction 15a. Neurotic records will include 
more minus m responses than normal records. 

Prediction 156. The pretherapy records of neu- 
rotics will include more minus m responses than 
the later records. 


Statement 16. Plusminus m “always in- 
dicates an unsatisfactory object-relationship 
with subjectively experienced frustration in 
this sphere.” It also indicates “basically 
undecided sexual orientation. They [people 
giving plus-minus m reactions] were indi- 
viduals who actually experienced their basic 
bisexuality, thus being unable to derive sat- 
isfaction from either hetero- or homosexual 
object cathexes” (p. 143). Unsatisfactory ob- 
ject-relationships, felt frustrations, and unde- 
cided sexual orientation characterize the mal- 
adjusted individual rather than the normal. 


Prediction 16a. Plus-minus m reactions will occur 
more frequently among neurotics than among nor- 
mals. 

Prediction 166. Plus-minus m reactions will 
occur more frequently among maladjusted indi- 
viduals before the beginning of therapy than after- 
wards. 

Statement 17. “One could say that subjects 
with plus p have usually a high capacity for 
symbolizing their needs, by which is meant 
that some aspects at least of their emotional 
needs are actually perceived . . . plus p indi- 
cates that the urging quality of the emotions 
have risen to consciousness and have been 
conceptualized in some way” (p. 171). The 
recognition and conceptualization of real 
needs and emotions are more typical of nor- 
mals than they are of neurotics. 
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Prediction 17a. Normals will have higher plus p 
scores than maladjusted people. 

Prediction 176. After they have had some therapy, 
the maladjusted group will have higher plus p 
scores than they had formerly. 


Statement 18. “The more loaded the minus 
p is, the greater is the tension and urgency 
of the needs demanding to be acied out... . 
They | subjects with minus p reactions] struc- 
ture their environment according to their own 
pattern of needs without, however, being con- 
scious that this is what they are doing .. . the 
need-tensions are acted out without having 
first passed through the system of precon- 
scious, thus, without their becoming linked 
to word-images” (p. 178). Minus p signifies 
a “lack of understanding of the full psycho- 
logic implications of our own actions, the 
lack of recognizing the connection between 
everyday activities and the underlying, deeper 
emotional needs” (p. 179). Although nor- 
mals often show this lack of understanding 
of their own needs and actions, it is more 
characteristically an important feature of the 
personality of the maladjusted. 


Prediction 18a. Neurotics will have a greater 
number of minus p reactions than normals have. 

Prediction 186. Neurotics will have a greater 
number of minus p reactions before they begin 
therapy than they will after some therapeutic 
progress has been made. 


Statement 19. “Subjects with plus k are 
likely to have good insight into emotional 
processes; they are willing to face their own 
emotions” (p. 194). Insight of this kind 
and willingness to face emotions are non- 
neurotic traits. 


Prediction 19a. There will be a greater frequency 
of plus & reactions in the normal than in the 
maladjusted group. 

Prediction 19b. There will be a greater frequency 
of plus & reactions in the maladjusted group after 
therapy has begun than there was earlier. 


Statement 20. “The minus & indicates that 

. the only way to assure at least a rela- 
tive harmony within the ego is to repress the 
forbidden impulses” (p. 198). “Minus & sub- 
jects do not dare to live out their individualis- 
tic needs openly, nor do they have the need 
to face what those needs really are. Standards 
and value judgments are readily accepted 
from the outside, with little questioning of 
their origin and validity” (p. 201). Repres- 
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sion, attempts to exclude parts of himself, lack 
of courage in living out his needs, and the 
unquestioning acceptance of popular stand- 
ards and values generally denote a malad- 
justed person. 


Prediction 20a, Neurotics will have higher minus 
k scores than normals. 

Prediction 20b. Maladjusted people will have 
higher minus & scores before they have had therapy 
than they will afterwards. 


Statement 21. “A plus-minus & is the typi- 
cal reaction of those subjects for whom 
establishment of emotional independence con- 
stitutes a central problem . . . in individuals 
with plus-minus &, the need to be independ- 
ent, rational, and detached in emotional mat- 
ters is experienced consciously” (p. 203). The 
reaction also indicates “subjective feelings 
of anxiety” (p. 203). Emoticnal detachment, 
together with feelings of tension, conflict, and 
anxiety, especially in regard to the establish- 
ment of emotional independence, may be con- 
sidered symptoms of maladjustment. 

Prediction 21a. The maladjusted group will give 
more ambivalent & reactions than the normal g-oup. 

Prediction 216. Before therapy, the maladjusted 
group will have more ambivalent & reactions than 
it will after therapy. 

Statement 22. Open & reactions refer to the 
subjects’ “acceptance of themselves as they 
are, with all their needs and expectations 
of environmental objects to facilitate without 
difficulty the gratification of these needs” (p. 
207). Self-acceptance and the assurance that 
needs will be gratified are typically traits of 
normals rather than of neurotics. 


Prediction 22a. Normals will have a higher fre- 
quency of open & reactions than will neurotics. 

Prediction 22b. Neurotics will have a higher fre- 
quency of open & reactions after they have had 
therapy than they had previously. 


REsuLTs 


In order to test the 22 pairs of predictions 
derived from Deri’s statements, the frequency 
of occurrence of each reaction choice was 
considered the individual’s score in that fac- 
tor. The differences between the mean scores 
of the neurotic and normal group on each 
factor were then determined to check the a 
predictions, while the 4 predictions were 
checked by comparing the neurotic group’s 
pretherapy and during-therapy mean scores 
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TABLE 1 


Comparisons OF Neurotic AND NorMat Groups on Szonpi Factor 
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open & 
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-50 
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-70 
-I0 
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-40 
-05 
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for the 22 factors. In all cases, t-tests were 
used to determine the significance of the dif- 
ferences. The results of these comparisons 
are summarized in Tables 1 and 2. 

These tables show that our data confirm 
only one of these predictions. As had been 


predicted, ambivalent ¢ reactions occur with 
significantly greater frequency in the neu- 
rotic than in the normal group. Actually, 
this finding cannot be accepted as strong sup- 
port for the test, since, out of 44 ¢-tests per- 
formed, at least two differences significant at 
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the 5 per cent level could be expected on the 
basis of chance alone. 

It should be noted that the difference be- 
tween normals and neurotics in the m+ fac- 
tor approached significance, neurotics hav- 
ing a greater frequency of this reaction type 
than normals. This finding is particularly 
interesting, in view of Deri’s statement that 
“well-adjusted adults who are able to form 
and maintain satisfactory social relations, are 
expected to give plus m.” 

Table 2 makes it clear that the pretherapy 
Szondi reactions of neurotics cannot be dis- 
tinguished from their reactions to the test 
after they have made therapeutic progress. 
In short, as far as we can ascertain from our 
data, the Szondi test does not reflect clin- 
ically observed changes which are the results 
of psychotherapy. 

As a result of the compiete failure of the 
test to make the diagnostic discriminations 
its proponents claim it can make, the writers 
are in agreement with Fosberg’s statement 
that “the claims for this technique must be 
looked upon with great skepticism” (2, p. 
43). The findings indicate that Deri’s state- 
ments about Szondi test interpretations have 
no diagnostic significance. Therefore, we 
must conclude that reactions to the test, if 
interpreted according to Deri’s formulations, 
have no clinical meaning. Until there is 
some strong evidence indicating that under 
certain conditions clinically valid diagnoses 
can be made on the basis of the Szondi test, 
its general use as a psychodiagnostic tech- 
nique should be discouraged. 


SUMMARY AND CONCLUSIONS 


On the basis of specific interpretive state- 
ments taken from Deri’s Introduction to the 
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Szondi Test, certain predictions about how 
normals and neurotics would differ in their 
reactions to the Szondi test were derived. 
Parallel predictions were made about how 
changes as a result of therapy would be re- 
flected in changes in test reactions. 

The test was administered six times to 20 
normal subjects and 20 neurotic patients at 
a student clinic. The neurotics were retested 
two or three months later, after they had 
made some therapeutic progress. 

In general, the data do not support Deri’s 
statements about the interpretive significance 
of the Szondi test factors. According to this 
evidence, the test does not distinguish be- 
tween maladjusted and normal people, except 
in the ambivalent e factor. This one sig- 
nificant difference might easily be accounted 
for on the basis of chance alone. The pre- 
therapy reactions of the neurotic group could 
not be discriminated from their reactions 
after they had two or three months of therapy 
and were improving. 

In view of earlier findings reported by Fos- 
berg and these results, it is felt that diagnoses 
of the nature or extent of maladjustment on 
the basis of Szondi tests may be misleading 
and invalid. 
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SIMILARITY OF VALUINGS AS A FACTOR IN SELECTION OF 
PEERS AND NEAR-AUTHORITY FIGURES 
JOSEPH A. PRECKER 
Bard College 


tT the present time considerable interest 
is being manifested in the processes of 
group-tormation and interpersonal re- 
lationships, for practical as well as theoretical 
purposes. The present study is an attempt to 
gain greater understanding of the social proc- 
ess through an investigation of some aspects 
of selection of associates. 

The underlying factor operating in “choice” 
of associates to be postulated here is similar- 
ity of values (14, 15, 16) in the area(s) of 
living called upon by the nature of the social 
interaction. Values (or valuings), as used in 
this context, are not viewed as “residing” in 
the object nor in the self, but are relational 
in quality, linking the object and the self 
in some meaningful relationship (2, 6). Nor 
can valuing be separated from the act or po- 
tential act--it is behavioral in function. Val- 
uings in a given organism may be conflicting, 


meaningful primarily in terms of the situa- 
tions in which they operate, and relatively 
slow to change. They may be provisionally 
defined as systems of probable behavior oc- 
curring under stated conditions involving an 
overt or covert approach-indifference-with- 
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drawal dimension. All behavior can be 
viewed as involving an evaluational element 
—that is, can be investigated as manifesta- 
tions of the valuing process (18). Valuings 
operate in the selection of associates, it is 
here suggested, since they allow for a uni- 
verse of discourse, an operational “language,” 
which facilitates intercommunication and, 
thereby, interaction. 

The setting. In order to comprehend the 
specific form of the hypotheses tested in this 
study, it is necessary to have some idea of the 
context in which it was carried out. There- 
fore the description of the college setting is 
presented before the hypotheses. The college 
is a small coeducational liberal arts college of 
a “progressive” nature, located about 100 
miles from a major city and about five miles 
from the nearest town. The college is fre- 
quently referred to as “the community” and 
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there are relaxed relationships between and 
amongst students, faculty, and administration 
unusual in most educational institutions. All 
classes meet in small seminars, and each 
student has an individual tutorial held with 
a specialist (the adviser) in the, area of the 
student’s major interest. The adviser, in 
addition to his teaching responsibilities, 
assists the student with many matters that 
usually do not enter into student-teacher 
relationships. Even though the relationship 
between the adviser and the~advisee is often 
friendly and persists beyond the conference 
room, the fact that the adviser has more than 
a passive hand in leading the tutorial, and 
also is responsible for evaluating the student’s 
work—placing him in the role of a “judge”— 
gives the adviser some of the characteristics 
of an authority-figure. 

At the time of this study a student-faculty 
committee was attempting to set up criteria 
to be utilized in educational evaluation. The 
present author and two other social scientists 
were called upon to construct and administer 
a questionnaire designed to gather informa- 
tion from the total community concerning 
the criteria to be used in evaluation. This 
value material, in addition to sociometric 
material that could be gathered at the same 
time, led the present author to feel that the 
hypotheses listed below could be investigated. 

Hypotheses: 1. Students tend to select 
associates with similar values in an important 
area of their functioning. 

(a) Greatest similarity of values tends to 
occur when reciprocal choices are made. 

(4) Seniors tend to choose associates whose 
values are more similar to their own than do 
freshmen. 

2. Students tend to choose advisers (in a 
free-choice situation) whose values (as re- 
flected in educational criteria for evaluation) 
resemble their own. 

(a) Greatest similarity of values tends to 
occur when real adviser and adviser-choice 
are the same person. 
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(4) Seniors tend to choose advisers whose 
valuings are more similar to their own than 
do freshmen. 

Subjects. Subjects comprised the student 
body (N==242) and faculty (N=42) of 
Bard College. All but two students and one 
faculty member cooperated in this venture. 

Procedures. In an open-ended question- 
naire the subjects were asked to describe 
those characteristics which they felt to be 
important as criteria for evaluation (utilizing 
a form of the critical incident technique). 
Considering the nature of the setting, the 
variety of responses was greater than would 
be expected in a traditional educational insti- 
tution. These free responses (amounting to 
1325 criteria) were then transferred to indi- 
vidual McBee cards, and categorized by three 
independent judges, who then compared the 
categorizations, and agreed on 39 individual 
categories. The subjects were then called 
upon to rank the resulting 39 criteria accord- 
ing to their conception of the importance of 
the criteria. 

The following near-sociometric questions 
were asked of the students: 

1. “If it were possible for you to keep in 
touch with only three students here now 
after you leave college, which three would 
you choose?” (9; 10). 

2. “If you were able to choose as your 
adviser anyone teaching here this semester, 
irrespective of subject matter, and whether 
or not you have had any direct contact with 
him or her before, whom would you choose?” 

No information was given concerning the 
aim of this study as far as relating the criteria 
for educational evaluation and sociometric 
choice. Two weeks later a test-retest relia- 
bility run was administered to 47 students, 
in which the ranking of the 39 criteria and 
the sociometric choices were repeated. 

The criteria utilized had implicit meaning 
in the situation since they were of direct and 
personal concern to all of the subjects investi- 
gated, since the subjects were aware of the 
probability that the highly ranked criteria 
would be used in the future for the evalua- 
tion of the students themselves. These 
ranked criteria constituted the valuings used 
in this study for comparison of mutual socio- 
metric choices with unreciprocated choices 
and with individuals selected at random who 
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had not chosen each other. In addition, a 
measure of homogeneity was employed for 
the total group, utilizing Kendall’s coefficient 
of concordance (5), against which all meas- 
ures of relationship can be compared. 

Statistical treatment. The Spearman for- 
mula for rank-order correlation (3) was used, 
the 39 criteria constituting the samples cor- 
related for each pair of individuals. Since 
the sampling distribution of R (rank-order 
correlation) approaches the normal form as 
N increases, and since N in the present in- 
vestigation is as large as 39, we can refer 
t==R\/ N—1 to the normal probability table 
as a test of significance for R. With an N 
of 39, an R must be greater than .318 to be 
significantly different from zero at the .05 
level; .419 at the .o1 level; and .534 at the 
oor level. 

However, in terms of the hypotheses here 
set forth for testing, it is not sufficient to test 
the significance of the difference from zero 
of a single R, but it is also necessary to test 
the significance of the difference between two 
means, each consisting of many R’s. In order 
to do this, a procedure utilized in two studies 
by Lorge (7, 8) was followed. The proce- 
dure utilized in the present study required 
that any R be treated as an estimate of r, and 
be transformed to Fisher’s z. The mean R’s 
were arrived at by means of the transforma- 
tion to z, averaging of the 2’s, and then 
reconverting to r. This estimate is often 
slightly different than would be arrived at 
by estimating the mean merely by averaging 
of the r’s, since the distribution of r’s is not 
normal, particularly if there is a correlation 
other than zero. 

To test the significance of the difference 
between two 2’s, the actual difference be- 
tween the 2’s is divided by the standard error 
of the difference and the result referred to 
the normal probability table. The formula 
for the standard error of the difference be- 


a I 
tween zs is ¢ = a 
21-22 \ 2 —3 + 





—*__ where 
; R3—3 

n is the number of pairs of items utilized 
in arriving at a given r. However, in a situ- 
ation such as the present, in which it is nec- 
essary to compare the means of two sets of 


z’s, this formula is modified. As Edwards 
says: “In some research problems it may be 
desired to test the significance of the differ- 
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ence between two such average values of r 
obtained by the methods above. This can be 
accomplished in the manner described earlier 
by finding the standard error of the difference 
between the corresponding values of z. The 
values of m, and ng in the formula [above] 
should be taken as the corresponding values 
of 3(n—3)+3,” (3, p. 136), or, 


I 4 I 

is VN.G@,—3) 43 N2(m2—3) +3 
where n is the number of pairs of items 
utilized in arriving at a given r, and N is 
the number of r’s, transformed to 2’s, utilized 
in arriving at the mean, in the special case 
where the n’s are the same for each of the 
r’s. In this way, the mean R’s were com- 
pared for the various groupings. R in the 
following pages should be read as “mean 
rank-order coefficient of correlation.” 

Since a high degree of homogeneity in the 
ranking of values was expected, it was neces- 
sary to utilize some measure of the degree 
of agreement in the sample as a whole, 
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against which the appropriate R’s could be 
compared. In the present study the index 
of homogeneity used was Kendall's coefh- 
cient of concordance (5)." 

In the present investigation, if the coefh- 
cient of concordance does not indicate signifi- 
cant agreement amongst the rankings of the 
entire sample, we can assume that the group 
is relatively heterogenous in regard to the 
characteristics being investigated (values as 
reflected in criteria for evaluation). How- 
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formula is W = ———— 
m2N(N?—1) 
sum of the squares of the deviations of the m sums of 
rank around their mean, m is the number of rankings, 
and N is the number of items ranked. W, the co- 
efficient of concordance, may have values ranging from 
© to 1 but it cannot be negative. R, the mean rank 
order coefficient among all possible pairings, is related 

mW —1 


1 The where S is the 


to W by the formula R = — 

m—t! 
fectly with the mean of m—1 R’s arrived at by com- 
puting the individual R’s for all possible pairings and 
dividing by the number of R’s. 

In to test the significance of the agreement 
amongst the various rankings, Kendall has indicated the 
applicability of the F test, taking the particular form 

(m—1)W 2 
F=- - with degrees of freedom m—N—1——, 
—W m 
= (m—1)(N—1—2 ). 


This R agrees per- 
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and ms 
m 


PRECKER 


ever, if significant agreement is found, we 
must assume some degree of homogeneity. 
In addition, if there is no significant agree- 
ment in the group as a whole, but significant 
agreement in line with the hypotheses here 
being tested (in terms of association and 
selection of peers), then we can assume that 
our hypotheses are upheld. But if signifi- 
cant agreement is found in the group as a 
whole, and significant agreement is found 
in the pairings of associates in line with the 
hypotheses, it becomes necessary to compare 
in some way the extent of agreement in the 
respective groupings before the hypotheses 
can be confirmed. This, in fact, was the 
problem confronted in this study, leading to 
the adoption of the technique described 
above. Now that we have anticipated the out- 
comes, we are ready to present the results. 

Reliability. In the test-retest reliability 
run two weeks after the initial ranking, 47 
subjects ranked the 39 criteria once again. 
Of the 47 records, 45 were usable. The mean 
R of the 45 individuals, compared on their 
first and second rankings, was .69, the R’s 
ranging from .28 to 87. Since the distribu- 
tion was positively skewed, the median R is 
a more meaningful datum, being .75 in this 
case. Of the 45 R’s computed, 42 were sig- 
nificantly different from zero at the .or level. 
Of the 45 subjects in the reliability run who 
chose an adviser two weeks after the initial 
run, only 4, or g per cent, changed their 
choice. 

Agreement in total group. Computing 
the R from the coefficient of concordance for 
the total student group, taken in all possible 
pairings, we arrive at a mean R of .27, which 
is significantly different from zero at the .o1 
level (F=85.86; m,—=38, m.—=8,474). The 
R indicates that the group is significantly 
homogenous in regard to their ranking of 
the 39 criteria or values. It is therefore nec- 
essary to take this agreement into account in 
making any comparisons or claim of agree- 
ment between pairs of associates; in other 
words, the agreement must be greater before 
any additional factor can be said to be 
operating. 

Student ranking of criteria compared with 
ranking by peer-choice. The hypotheses 
under consideration here demanded the com- 
parison of mutual-choice pairings, one-way 
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pairings, and no-choice pairings. Further 
breakdown into groupings, such as freshmen 
and seniors, was also necessary. Table 1 
presents the R’s, arrived at by means of the z 
transformation, of the three possible choice 
relationships. For the no-choice group, 31 
pairs of students who had not selected each 
other were chosen at random. The signifi- 
cance of the difference between R’s is also 
tested. 


TABLE 1 
SIGNIFICANCE OF THE DIFFERENCE BETWEEN R’s WHEN 


R’s ror Mutuat-Cuoice, One-Way Cuoice, 
AND No-Cuolce ARE COMPARED 








Group NumsperorR’s R tft 





Student with peer, mutual choice 59 393 
1.48 
Student with peer, one-way choice 
Student with peer, mutual choice 
Student with peer, no choice 


Student with peer, one-way choice 


Student with peer, no choice 





* Significant at the .02 level of confidence. 


In Table 2 the various pairings have been 
compared with the agreement in the total 
group, arrived at through Kendall's coefh- 
cient of concordance. 


TABLE 2 


SIGNIFICANCE OF THE DIFFERENCE BETWEEN R’s WHEN 
R’s ror Mutuat-Cuoice, One-Way CHOoIce, AND 
No-Cuoice Parrincs ARE COMPARED WITH 
AGREEMENT IN THE ToTtaL Group 





Group NuMBER OF R’s 





Student with peer, mutual choice 59 
Agreement in total student group 24976 
Student with peer, one-way choice 73 
Agreement in total student group 24976 
Student with peer, no choice 31 


Agreement in total student group 24976 





* Significant beyond the .001 level of confidence. 


A further comparison was made in which 
freshmen and seniors were treated as separate 
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groups. These results are presented in 
Table 3. 


TABLE 3 


CoMPARISON FoR R’s WHEN R’s For Mutuat-Cuolce, 
One-Way Cuotce, anp No-Cuoice Pairs 
ARE COMPARED FOR FRESHMAN 
AND SENIOR GROUPS 








Group CLass NuMBER OF R’s 





Student with peer, 
mutual choice Freshmen 
Seniors 


Student with peer, 
one-way choice Freshmen 

Seniors 

Student with peer, 
no choice Freshmen 


Seniors 





By examining Table 1, we can investigate 
the hypotheses related to the choice of asso- 
ciates. In line with the expectations implied 
in the hypotheses, we find that the R (.39) 
is highest for pairs making reciprocal choices, 
lower (R=.36) in pairs in which one student 
has chosen another but has not been chosen 
in return, and lowest (R=.31) for no-choice 
pairs chosen at random. By testing the sig- 
nificance of the difference between R’s, we 
find that the only significant difference (at 
the .02 level) is the comparison of the R for 
students and their peer-choices (when choice 
has been reciprocated) and the R of student 
no-choice pairs (students who have not 
chosen each other) selected at random. Since 
the R of the 31 pairs of no-choice students is 
rather high when compared with the R for 
the total student group arrived at through 
Kendall’s coefficient of concordance, even 
though the R (.31) of the no-choice pairs is 
not significantly different from the R (.27) 
for the group as a whole, it was deemed 
advisable to compare the R’s for the various 
choice-pairs with the R indicating general 
agreement in the group as a whole to get a 
more clear-cut picture of the situation. These 
results are reported in Table 2, where we 
find significant differences considerably be- 
yond the .oor level, when we compare the 
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R for the mutual-choice pairs, and the R for 
the one-way choice pairs with the R for the 
group as a whole. The direction seems most 
obvious in the mutual-choice pairs. 

In order to test the possibility of a grow- 
ing tendency with age, or with increased 
proximity, to choose associates with similar 
values, freshmen and seniors were compared 
in the various groupings. The data are pre- 
sented in Table 3. Owing to the small num- 
ber of cases available for this comparison, 
conclusions are tentative, but the small differ- 
ences revealed are well within the range 
admissible through sampling error and the 
like. We tentatively conclude that this tend- 
ency does not increase with age, nor with 
increased proximity. (Since this study was 
conducted in the spring semester, most of 
the freshmen had the opportunity of approxi- 
mately six months of contact before making 
their choices.) 

In summary, we find the major hypotheses 
upheld. This states that students will tend 
to select associates in the peer group with 
similar values. The subhypothesis which 
states that greater similarity of values will 
tend to occur when the reciprocal choices are 
made is upheld most clearly. 

Student ranking of criteria compared with 
adviser-choice’s ranking of criteria. In order 
to test the hypotheses under consideration 
here, it was necessary to compare the situ- 
ations in which (a) the and the 
adviser-choice are the same person; (4) the 
adviser and the adviser-choice are different. 


adviser 


In the latter case, it was also necessary to 
study the relationship between the student 
and the real adviser. The results of these 
comparisons may be seen in Table 4. 

A further comparison was made, in which 
freshmen and seniors were treated as separate 
groups, in order to study possible changes 
with time or propinquity. The results of 
this comparison may be seen in Table 5. 

If we examine Table 4 we find that the 
R (28) students and their real 
adviser is lowest when the student’s adviser- 
choice does not coincide with his real adviser. 


between 


The R (.24) is higher in the situation in 
which the student’s ranking of criteria is 
compared with the ranking by their adviser- 
choice when the adviser-choice is not his real 
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TABLE 4 


SIGNIFICANCE OF THE DIFFERENCE BETWEEN R’st 


WHEN R ror Srupent—Reat Apviser, R For 
SrupenT—Apviser-Cuoice (ADVISER AND 
Apviser-Cuoice DIFFERENT) AND R FoR 
SrupENT—ApvisER-CHOICE (ADVISER 
AND ADVISER-CHOICE THE SAME) 

ARE COMPUTED 








Group NumsBerorkR’s R ¢ 





Student with adviser-choice, 
adviser-choice and adviser 
the same 


Student with adviser-choice, 
adviser-choice and adviser 
different 


Student with adviser-choice, 
adviser-choice and adviser 
different 


Student with real adviser, 
adviser and adviser-choice 
different 


Student with adviser-choice, 
adviser-choice and adviser 
the same 


Student with real adviser, 
adviser-choice and real 
adviser different 44 





* Significant at the .01 level of confidence 

tR should be read as “mean rank-order coefficient of 
correlation,” in this case, of the ranking of the 39 value- 
criteria of students and faculty in appropriate categories. 


TABLE 5 


acai 
SIGNIFICANCE OF THE DIFFERENCE BETWEEN R’s WHEN 
R ror StupENT-REAL Apviser, R ror STUDENT- 
Apviser-Cuoice (ApvIsER AND ADVISER- 
Cuoice THE SAME), AND R ror STUDENT—- 
Apviser-Cuoice (ADVISER AND ADVISER- 
Cuotce DirFer—ENT), FOR SENIOR AND 
FRESHMAN Group ARE COMPARED 


Group 





Freshmen 
Seniors 


Student with adviser, 
adviser-choice and 
real adviser the 
same 

Freshmen 

Seniors 


Student with adviser- 
choice, adviser- 
choice and adviser 
different 

Freshmen 

Seniors 


Student with real 
adviser, adviser- 
choice and adviser 
different 





* Significant at the .02 level of confidence. 
+ Significant at the .01 level of confidence. 
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adviser, and highest (.38) when the students’ 
ranking of criteria is compared with their 
adviser-choice’s when the adviser and the 
adviser-choice are the same person. How- 
ever, testing the significance of the difference 
between these K’s we find that the only 
clearly significant difference (at the .o1 level) 
is the difference between the R of .28 for 
students’ ranking of criteria compared to the 
ranking by the real adviser, when the real 
adviser and the adviser-choice are different, 
and the R of .38 for student ranking com- 
pared to the ranking of criteria by the real 
adviser when the real adviser and the adviser- 
choice are the same. 

Here we find support for the major 
hypothesis. 

Comparing freshmen and seniors in ad- 
viser-choice relationship. Comparing seniors 
and freshmen in Table 5, we find that seniors 
tend to demonstrate greater similarity to 
their real advisers than do freshmen. Test- 
ing the significance of the differences, we 
find that the R of .42 for seniors and advisers 
is different from the R of .32 for freshmen 
and advisers, significant at the .o2 level. We 
find no significant difference between fresh- 
men and seniors in their relationship with 
adviser-choice, but the difference between the 
R of .26 for seniors and real advisers, and the 
R of .22 for freshmen and real advisers, is 
significant at the .or level of confidence. The 
meaning of these tendencies is that seniors 
tend to have educational values more similar 
to those of their real adviser than do fresh- 
men, regardless of whether or not they select 
the adviser as their preferred adviser-choice. 
This is not enigmatic, even though it is not 
possible to account for the extent of the con- 
tribution made by the following factors: (a) 
Seniors have had more opportunity to ob- 
serve their advisers than have freshmen. (4) 
Seniors have had at least some opportunity 
to choose their adviser which is usually not 
the case with freshmen. (c) Seniors are more 
likely to have an adviser with major interests 
more like their own, since they are doing a 
senior project in which they are usually con- 
siderably involved, and which is often in the 
area of the adviser’s interest, since it is his 
responsibility to guide the project closely, as 
well as to give aid in the formulation of the 


problem. (d) Since many seniors have been 
with their adviser for several semesters, their 
values may have undergone change in the 
direction of greater similarity to their ad- 
visers’. 

Discussion. Before discussing the general 
implications of the study, it is advisable to in- 
dicate the shortcomings of the present ap- 
proach: 

1. The difficulties of the measuring device: 
Although ranking seemed appropriate in the 
setting, and is a meaningful task in a choice 
situation, it nevertheless presents certain in- 
surmountable difficulties, not being an instru- 
ment with equal step-intervals. In addition, 
in a situation in which there are 39 criteria, 
the discrimination becomes difficult, and vari- 
ability of response is likely to occur. This in- 
creased variability may result in lowering the 
reliability, as well as diminishing of the re- 
lationship between two sets of rankings (4). 

2. Lack of statistical techniques for the 
comparison of rank-order coefficients of cor- 
relation. Since mathematical _ statisticians 
have not yet devised techniques for the com- 
parison of Rs, it was necessary to use tech- 
niques which only approximate the desired 
results; that is, techniques applicable to Pear- 
son’s r, utilizing the z transformation func- 
tion of Fisher, in which the assumption was 
made that R is an approximation of 7. In the 
light of this, even though the results are in 
the direction of the major hypothesis, they 
are dependent upon these assumptions. 

3. Lack of direct personal pertinence of 
many criteria for the respondent. Even 
though an attempt was made to utilize the 
characteristics suggested by the group as a 
whole, rather than selecting characteristics 
which appeared suitable to the investigator, 
each member of the group did not mention 
all of the 29 characteristics, nor even most 
of them. Many of the criteria may not have 
been of central concern to the individual, al- 
though he may have recognized their impor- 
tance in some hierarchy of criteria. Perhaps 
the most meaningful approach to pairings 
would have been some way of utilizing the 
original “free” responses of the individuals 
obtained in the initial step in this study, but 
this would have led to a variety of difficul- 
ties in the statistical treatment of the data. 
However, the greater the pertinence and 
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“realness” of the values to the individuals 
concerned, the greater the meaningfulness of 
the results of the investigation. 

4. No-choice does not mean rejection. In 
order to investigate the present set of hypoth- 
eses most completely it is necessary to study 
rejection patterns as well as acceptance pat- 
terns. This was not undertaken in the pres- 
ent investigation. Since the number of 
choices was limited, it is likely that other 
choices would have been made if additional 
choices were possible. In addition, choices 
may have been forced, owing to the nature 
of the question or “social pressure,” and a 
person caring to make but one choice may 
have felt it necessary to make all three, pos- 
sibly watering down the factor under con- 
sideration. In a community in which there 
is a certain minimal agreement of values, the 
possibility of choices may be quite extensive. 
The comparison of choice pairings with no- 
choice pairings might have been more dra- 
matic, and more revealing of the processes 
postulated in this study if no-choice meant 
rejection rather than possible acceptance, and 
choice was not limited by the arbitrary 
boundaries of limitation to, or expectation of, 


three peer choices and one adviser choice. 


MacLeod has written that 

. the experience of collective responsibility can 
extend itself only within whatever limits are indi- 
cated by the term “we,” and “we” can only refer 
to myself and other persons who are apprehended 
as other selves. . . . Within the category of psycho- 
logically apprended persons there is a range which 
extends from persons who are relatively neutral or 
indifferent to us to persons who are highly charged 
with selfhood . and these phenomenal 
differences betoken crucially important determi- 
nants of our behavior with reference to other per- 
sons, and ultimately, of the way in which the social 
world becomes organized for us (11, p. 195). 


The problems investigated in this study are 
related to the degiee of “selfhood” manifested 
in object-choice. It is not the “accidental” 
factors of association that have been sought 
in this study, but rather the organizing 
(though not necessarily conscious) factors 
which stem from (a) the individual’s needs 
and values; (6) the meaning assigned to ex- 
perience; and (c) the degree of “selfhood” 
manifested in object-choice. 

Individuals tend to select associates whose 
valuings are similar to their own in the con- 
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text in which they function. The automor- 
phic process (17)—the tendency to understand 
primarily in terms of our own experience— 
makes the choice of associates more likely 
to occur from those in whom there is 
this similarity of experience. Similarity of 
values allows for increased interaction, since 
values, conceived of as behavioral and rela- 
tional (2, 6), allows for similarity of action 
and thereby increases the possibility of 
mutual understanding: similarity of values 
allows for a mutual language, and therefore 
increases the probabilities of (a) intercom- 
munication, (4) effective understanding, and 
(¢) joint action which is mutually tension- 
reductive. Similarity of values also operates 
in line with the defenses of the self—if values 
are rejected, then the self is rejected. By the 
same token, if our values are accepted, the 
self is accepted, making a foothold whereby 
interaction may begin on an effective basis, 
and whereby the possiblities of value develop- 
ment, or value change, in noncrucial areas of 
the self, are increased. The processes of iden- 
tification and projection also come into play. 
Identification is more likely to take place 
when the object possesses characteristics 
which are highly valued by the subject, as 
well as manifests a minimal similarity of val- 
ues. With this basic similarity, the subject 
is likely to fill in the gaps by attributing to 
the object-choice values in other areas similar 
to those of the subject. That this is the case 
has been demonstrated in another study by 
the present author (13). 

By going beyond the data in the present 
study, assuming that individuals tend to select 
peers with similar values, as well as authority- 
figures, leaders, and hero or ego ideals, we 
can tentatively suggest certain of the factors 
operative in effective leadership. The effective 
(chosen) leader must manifest or suggest 
similarity of values in the crucial areas of 
functioning in which this group is operating, 
as well as in concentric value-areas—or else 
remain sufficiently ambiguous in these other 
areas to allow the process of projection to 
fill in. Since we know that the process of 
communication is of vital importance in so- 
cial interaction, and that the more efficient 
the communication process the more effec- 
tive the nature of the organized behavior can 
be, the similarity of values effectively cuts the 
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communication process, since there are al- 
ready operative similar tendencies to behave, 
and only reduced cues are necessary to set 
this action in motion. Also, providing that 
the situation calls upon the area(s) of similar 
valuing, the leader and the led (or the group 
operating with similarity of values) enter into 
joint action which is mutually tension-reduc- 
tive, which in turn reinforces the social bond 
holding the group, or the leader and the led, 
together in their relationship. 

Since the similarity of values may also be 
psychologically equated with the acceptance 
of others, this, too, in turn reinforces the so- 
cial bond. In manifesting similar values, the 
leader is not only saying that “What is the 
Good (the True and the Beautiful) for you 
is also the Good for me,” but also, “You 
are ‘Good’ for professing (or moving to- 
wards) that which obviously is the Good, the 
True, the Beautiful.” This verbal by-play is 
never directly expressed, but the implication 
in behavior amounts to the same thing. But 
the power in such a situation is not merely 
similarity of present values, but the inroads 
gained in this way lead to the increased sus- 
ceptibility to value-change, in the direction 
of the leader, or authority-figure, or friend. 

Although the ramifications of the present 
point of view can be extended considerably, 
and their pertinence exemplified in many ex- 
amples of various kinds of leadership and 
groupings, suggested hypotheses for further 
research will suffice for the present: 

1. Individuals tend to rate others more fa- 
vorably (in peer group, teacher-student rela- 
tionship, employer-employee _ relationship, 
leader-follower relationship, etc.) when val- 
ues in pertinent areas are similar. 

2. Productivity in pairs or in groups will 
tend to be higher when values are similar. 

3. Rejection patterns will tend to be most 
extreme when (a) the rejectee holds values 
which are diametrically opposed to*those of 
the rejector, or when (4) the rejectee holds 
values which are similar to those of the re- 
jector in certain areas of behavior, and values 
which are markedly dissimilar in other areas 
of behavior, both however capable of being 
called up by the situation in which inter- 
action is possible. 

4. Groups tend toward greater complexity 
and integration of functioning to the extent 


to which the group members demonstrate 
(or develop) similarity of values pertinent to 
the situation. 

5. Attitude change will take place most ef- 
fectively in social situations in which the in- 
dividual interacts meaningfully with some- 
one (or ones) who has values similar to the 
individual’s in at least one major area of the 
individual’s functioning. 

Summary. The major hypothesis that indi- 
viduals tend to select associates (peers and 
near-authority figures) with similar values 
was investigated. Values are here considered 
relational and behavioral, providing an opera- 
tional “language” which facilitates intercom- 
munication and interaction. The subjects in 
the present study consisted of a small pro- 
gressive college community. Free responses 
to an open-ended questionnaire were cate- 
gorized, and the resulting 39 value-criteria 
were ranked by each member of the com- 
munity. In addition, near-sociometric mate- 
rials were gathered. Subjects tended to 
choose peers and near-authority figures whose 
values resembled their own, thereby confirm- 
ing the hypothesis. This process was most 
marked when reciprocal choices were made. 

Some of the implications of this study were 
generalized to problems of group structure 
and action, and to situations involving lead- 
ership, indicating that values which are sim- 
ilar lead to more effective intercommunica- 
tion and action, as well as strengthen the 
social bond through bolstering the defenses 
of the self. Further implications regarding 
attitude change and other problems were 
suggested in several hypotheses for future 
investigation. 

Although the present investigation was 
specifically concerned with pairings (the 
dyad [1] if mutual interaction is present), 
the pair is the microcosm in which many of 
the aspects of the macrocosm of larger group- 
ings may be operative. 
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THE CRITERION IN COUNSELING: A GROUP 
PARTICIPATION SCALE * 


HAROLD B. PEPINSKY,? LAURENCE SIEGEL, ann ELLIS L. VANATTA 
The State College of Washington 


HE report of a recent conference on the 
“training of psychological counselors” 
(8) implies that the central task of psy- 
chological counseling is to help individuals 
with problems arising out of their interper- 
sonal relations. If this is an acceptable em- 
phasis, it would seem wise to search for 
relevant criterion measures. In our studies of 
counseling outcomes at the State College of 
Washington (5), we have assumed that the 
measurement of choice-behavior in social 
groups might provide one useful intermedi- 
ate criterion of interpersonal relations. 

One seemingly useful measure of choice- 
behavior in social groups is provided by 
sociometric methods, wherein members of a 
group are asked to designate other members 
of the group with respect to specified criteria 
of choice, e.g., “whom would you like to sit 
by?” “who are your best friends?” “whom do 
you visit?” etc. A review by Loomis and 


Pepinsky (4) indicates that sociometric meth- 
ods, while potentially useful, need further 
refinement in terms of definition and meas- 
urement. P. N. Pepinsky has warned against 


the uncritical application of traditional 
psychometric notions of reliability and valid- 
ity, urging “The systematic development of 
new concepts in a new frame of reference .. . 
a development in which rigorous statistical 
method is matched by rigorous theoretical 
definition” (6, p. 48). 


Tue ProsteM 


An interesting problem that arises in the 
construction and use of sociometric devices 
is how to describe the behavior sampled by 
the choosing activity of individuals in a 
group. One should be able to define such be- 
havior precisely. Similarly, one should be 
able to define precisely the behavior of others 
in the group toward an individual member 

1This study, the second in a series on the criterion 
in counseling, was made possible through a grant 
from the Committee on Research of The State College 
of Washington. It was reported at the APA mectings, 


Chicago, 1951. 
2 Now at The Ohio State University. 


in terms of the choices he receives from them. 
Sociometric techniques in current use suffer 
from the fact that dimensions of choice- 
behavior sampled by individual items have 
not been specified. As a consequence, results 
obtained by use of such instruments are diffi- 
cult to interpret. 

Confronted with this methodological prob- 
lem, we set ourselves the task of constructing 
a sociometric test, the items of which could 
be scaled at relatively equal intervals along 
a single dimension. Requirements in select- 
ing the dimension were that it could be (a) 
operationally defined and (4) described in 
terms of degree along a measurable con- 
tinuum. 


METHOD 


Step 1 involved selecting a dimension for study: 
“Effectiveness of pazticipation in group activity.” 

Step 2 was to attempt operational definition of the 
dimension. This is contained in the following ex- 
cerpt from instructions to judges who were to assign 
scale values to our sociometric test items: 


Tue Group Participation ScALE 

You are helping us to construct a group partici- 
pation scale. Your judgments will be used to 
establish scale values for the items rather than 
to evaluate your performance on this scale. (Note: 
This is not a test; your name is not called for!) 
The technique we are using is essentially that 
followed by Thurstone in the construction of an 
attitude scale. The complete tabulation of your 
judgments will be returned to you for class dis- 
cussion and consideration. 

1. Outside your family, what is the most im- 
portant group to which you now belong or have 
belonged to in the past? (Space for answer) 

2. What goals do you think are most important 
for that group? (What is the group trying to do? 
Be specific!) (Space for answer) 

3. Let’s assume that within this group: 
AN EFFECTIVELY PARTICIPATING 
GROUP MEMBER /NITIATES, DEFINES, 
SUSTAINS, AND DIRECTS ACTIVITY 
LEADING TO THOSE GOALS WHICH 
YOU THINK MOST IMPORTANT FOR 
THE GROUP. 
INITIATES: 

(a) Gets group activity started. 

1. Says or does something initially that 

others respond to. 
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Example: “I 
started. 
today?” 


at's get this meeting 


What shall we do 


DEFINES: 
(a) Proposes group activities. 
Examples: “Let's talk about our finances.” 
“Let's plan a party!” 
(b) Makes proposals for group activities more 
specific. 


1. Clearly separates one group activity from 


another. 

Example: “Couldn't we consider these 
proposals one at a time? On 
the one hand we have a pro- 
posal for a party. On the 
other hand we have a pro- 
posal for a campus clean-up 
day.” 

SUSTAINS 
(a) Causes group activity to continue at a 
high level. 
1. Keeps group from “bogging down.” 
“I know we are tired, but 
we must finish this job.” 
“Now we are getting some- 
where. Let’s keep going!” 


Examples: 


DIRECTS 
(a) Centers group effort on one specific pro- 
posal or task. 

Example: “Let’s postpone those other de- 
cisions and concentrate on the 
election of officers.” 

Theoretically, the most effectively participating 
all of these things in the 
The least effectively participating 
group member does none of these things in any 
degree. Most people will tend to fall somewhere 
in between these two extremes. 

It was assumed that the above definition would 
conform to the conception of “effective participation” 
which might be expected of the naive, adolescen: 
Moreover, it was assumed that 


' 
group member does 


greatest de gree. 


college student. 
eliciting responses to the two questions preceding 
the definition would help to reduce differences in 
“set” among the judges. 

Step 3 was aimed at item selection and scaling 
for the operationally defined continuum, such that 
an equal number of items could be placed at each 
of nine scale positions. We first compiled a list of 
195 “guess who” items describing a variety of roles 
which one might play in adolescent group activity. 
Most of these were obtained from listings provided 
by Hemphill (2), Deutsch * (1) and Lippitt (3). 
Two judges independently checked the 195 items, 
retaining only those (a) cut across 
three out of the four definition statements (i.e., 
“initiates, defines, sustains,-and directs”) or (6) 


items which 


8 We are indebted to Dr. Deutsch for supplying us 
with unpublished materials used in his research. 


which cut across none of the definition statements.* 
A third judge acted as arbitrator for the disputed 
items. This procedure reduced our list to 170 
items. 

After the scaling procedure of Thurstone and 
Chave (7), the entire group of 170 items was sub- 
mitted to a college social psychology class of 35 
students (juniors, seniors and graduates) for the 
purpose of deriving a median scale value and 
measure of interjudge variability for each item. 
These judges were asked to place each item on a 
graphic scale ranging from Position 1, “least effec- 
tive,” to Position 9g, “most effective,” in terms of 
the definition statements of effective participation 
(part of the instructions to judges is provided under 
Step 2). Items were arranged in random order, and 
the positions of the graphic scales (i.e., 1-9 or 9-1) 
under the items were randomized. A previous dry 
run with another group of 12 students had helped 
to eliminate “bugs” in the written instructions and 
other aspects of administering the scale. 

We then computed medians and semi-interquartile 
ranges from the ratings of the 170 items in the 
hope of securing five items at each of the 9g scale 
positions with O values less than 1.00. Actually, this 
criterion could be satisfied only for Positions 1-8, 
resulting in a revised list of 40 items as shown in 
Table 1; the Q values for Items 12, 13 and 31 
are also slightly greater than 1.00. ‘ 

A group of 20 “experts” in the field of social 
psychology throughout the country, persons who, 
in the writers’ opinion, had distinguished them- 
selves in their research and writings, provided com- 
parison ratings of the 4o items.° They were in- 
structed to use the original nine-point scale, while 
restricting their actual ratings to scale positions 1-8. 
They were not told to distribute their ratings in 
any way (e.g., a judge might have limited all of 
his ratings to scale positions 1-2 or 6-7). The 
comparison data are also shown in Table 1. We 
could thus obtain a final list of 24 items, three at 
each of the eight scale positions, with all but one 
item having QO values below 1.00. The scale values 
assigned for subsequent analysis were those of the 
student judges, since the scale was to be used 
subsequently with student groups. 

Step 5 consisted of randomizing the 24-item scale 
and administering it as a sociometric test to four 
fraternity groups at the State College of Washington. 
Subjects were allowed to use only one item to desig- 
nate each group member. We took care to have 
an indoctrination session with a dozen interested 
house presidents, including a written statement 
about our project which they could take back to 
their houses. After an unhappy experience with 
one group, which gave us a coldly hostile reception 
despite its earlier stated willingness to cooperate, 
we hit upon the expedient of inviting ourselves 


*For example, “Who is the best-dressed member 
of the group?” Actually, Criterion (4) was used only 
as a check on the relevance of our definition statements. 
We hypothesized that 12 such items would have large 
Q values. They all did. 

5We are grateful for returns from 90 per cent of 
the group whose help we had solicited! 
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TABLE 1 
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* 24 items used in sociometric scale. 


to dinner before administering the scale! We also 
emphasized that individuals could discuss their 
results in private conference with professional mem- 
bers of the Student Counseling Center staff. We 
were henceforth warmly received as helpful guests 
by the succeeding four groups whom we tested. 

Tabulation and analysis of the sociometric test 
data for each of the four fraternity groups com- 
pleted the research procedure. 


RESULTS 


In our opinion, Table 1 shows a remark- 
able agreement in the ratings of the student 
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and “expert” judges. The final group of 24 
“guess who” items is indicated in the table by 
asterisks at the right of the item numbers. 

Administration of The Group Participation 
Scale as a sociometric test was designed to 
answer three questions based on analysis of 
scaled scores derived from choices received 
on the test: (2) To what extent does the 
use of our scale as a sociometric test yield the 
possible range of effectiveness in group par- 
ticipation as defined operationally? (6) Are 
the derived scaled scores normally distributed 
about the midpoint of the scale* (c) Can 
members of a group agree in their choices 
of any given individual as indicated by lack 
of variability in the scaled scores received by 
him? 

Answers to these questions may be found 
in Table 2, which summarizes the medians 
and semi-interquartile ranges of scaled scores 
received by individual members in the four 
fraternity groups. The following results were 
obtained: 

1. As operationally defined, the possible 
range of scale values from “least effective” to 
“most effective” participation was 1-9. Scale 
values provided by the judges’ ratings ranged 
from 1.08 to 7.88 (Table 1). The range of 
choices used by the four fraternity groups 
was from 1.94 to 7.88, almost the entire range. 
Hence use of the scale as a sociometric test 
yields a large part of the possible range of 
scores. 

In Fraternity D, which has the smallest 
spread, a range of but slightly less than five 
out of the eight possible scale positions is 
obtained. 

2. Differences between the mean,zg and 
median,,q scores for each fraternity, shown 
at the bottom of Table 2, indicate a slight 
negative skewness in three of the four groups. 
The distribution of scores for Fraternity D 
appears to have a slight positive skew. The 
semi-interquartile range of scores for each 
group is less than 1.00, indicating leptokurtic 
distributions. In each fraternity the mid- 
point of obtained scores approximates 5.5, 
which is higher than the theoretically ex- 
pected scale value of 4.5. Thus, we cannot 
conclude that our derived scaled scores are 
normally distributed about the theoretically 
expected midpoint, but experience suggests 
that The Group Participation Scale is more 
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satisfactory in this respect than most other 
sociometric tests. 

3. No generalization is apparent with re- 
spect to the agreement within each group on 
choices assigned to any given individual, al- 
though 54 per cent of our subjects were 
assigned scale values with semi-interquartile 


Haroip B. Pepinsky, Laurence Srecer, anp Extis L. VanArra 


vide a useful intermediate criterion. Although 
sociometric methods may be used to measure 
choice-behavior, current sociometric tests suf- 
fer from the fact that dimensions of choice- 
behavior sampled by individual items have 
not been specified. 


We have attempted to construct a sociomet- 


TABLE 2 


MEDIANS AND SeMI-INTERQUARTILE RANGES oF ScALE VALUES AssIGNED To Eacu Susyect By MEMBERS OF 
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ranges of less than 1.00. This relatively high 
consistency in derived scaled scores received 
by individuals suggests the value of further 
research to determine the significance of ab- 
normally large O values. 


SUMMARY 
If the central task of psychological counsel- 
ing is to help individuals with their problems 
of interpersonal relations, the measurement 
of choice-behavior in social groups might pro- 
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ric test, the items of which are scaled at 
equal intervals along the operationally de- 
fined dimension of effectiveness in group 
participation. Two sets of judges, using 
Thurstone scaling methods, helped us to ob- 
tain a test composed of 24 “guess who” items, 
with three items at each of eight scale posi- 
tions ranging from “least effective” to “most 
effective” participation. The Group Partici- 
pation Scale used as a sociometric test in 
four fraternity groups yielded (a) almost the 
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entire range of scores as defined operationally 

by our scale, (6) a more normalized distribu- 

tion of scores than seems to be provided by 
most sociometric tests, and (¢) a relatively 
high agreement, among fraternities sampled, 
of group members in choosing given indi- 
viduals. 
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THE EFFECT OF THE PERCEPTION OF MILD DEGREES OF 
THREAT ON PERFORMANCE 


ARTHUR W. COMBS anp CHARLES TAYLOR 


School of Education, Syracuse University 


nyec and Combs (8) in their outline of 

phenomenological theory, have stated 

that the effect of the individual’s per- 
ception of threat to self is to reduce his per- 
ceptive field to the area of the perceived 
threat. This is by no means a new concept 
in the psychology of perception and has often 
been described as “tunnel vision.” Indeed, 
it is not uncommon to hear the phenomenon 
implied in the descriptions of the “man in 
the street” as he recounts his experience under 
frightening circumstances. Such observations 
as “all I could see was that huge truck coming 
at me” or, “I was so scared I didn’t hear 
what he said,” are commonplace with all 
of us. 

Recently in the psychological literature 
this phenomenon has been given much at- 
tention as an important factor in maladjust- 
ment. Rogers (6), for example, has stated 
the phenomenon as follows: “Any experience 
which is inconsistent with the organization 
or structure of self may be perceived as 
threat and the more of these perceptions there 
are, the more rigidly the self-structure is or- 
ganized to maintain itself” (6, p. 364). This 
effect of the perception of threat on the indi- 
vidual’s adjustive behavior has also been 
studied by Beier (2) and Cowen (5). All 
of these studies suggest the restriction of 
the perceptive field under threat. 

According to Snygg and Combs (8), the 
rationale of the maladjustive state is roughly 
as follows: The perceptive field of the threat- 
ened individual becomes restricted to the 
area of the threat he perceives. Hence, un- 
able to select his perceptions from a wider 
field, he behaves compulsively, or repeatedly 
behaves in a nonadjustive fashion. With his 
perceptive field narrowed to the field of the 
threatening object or event, he is unable to 
select from his perceptive field more adequate 
behavior. Apparently, the greater the degree 
of threat to self perceived by the individual, 
the more pronounced is the restricting effect 
upon the individual's perceptive field. 


Both Rogers (6, 7) and Combs (3) have 
pointed out that it is this effect of threat that 
nondirective therapy seeks, at least in part, 
to counteract in its emphasis upon a permis- 
sive nonthreatening atmosphere. Presumably, 
in the degree to which the client can be 
freed from threat, the more readily and 
widely he will be able to explore his percep- 
tive field and the more likely he is to discover 
new and more adequate perceptions of self 
and his relationship to the world in which he 
lives. 

The Problem of this Study 

Though the phenomenon seems well es- 
tablished in the case of traumatic events, it 
is not so clear whether this effect exists in 
the case of milder threats to self organization 
as perceived by the individual. This experi- 
ment was designed and carried out in the 
spring of 1949, to investigate whether or not 
the restricting effect of threat could be 
demonstrated in milder, social threats to the 
individual, where the threat perceived by the 
subject was not of an extreme character or 
could be made less extreme through the sub- 
ject’s ability to minimize or eliminate his 
perception of threat. 

In designing our experiment we stated our 
problem as follows: Presuming that the 
effect of threat in the perceptive field is to 
restrict perception to the field of threat, then, 
in a given simple task, we would predict 
that the introduction of mild threat to the 
self during performance would result in an 
increase in the time required to complete 
the task and the number of errors made in the 
performance of the task. 


The Production of a Mild Degree of Threat 

Since the perception of threat is an internal 
and personal experience of the subject, the 
degree of threat perceived is, of course, not 
open to direct external measurement. We 
were confronted, therefore, with the problem 
of how to assure that the threat experience of 
the subjects in our experiment would be a 
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mild rather than a stronger one. Theoreti- 
cally, the degree of threat experienced by the 
subject would appear to depend upon: 

1. The value in the individual's peculiar 
economy of the particular aspect of self which 
he perceives to be threatened. An individual 
who perceives himself as “a very brilliant 
fellow” would presumably be much more 
threatened by a slight to his intelligence 
than would another person who was less 
concerned about intelligence level. Thus, if 
one could find a way of threatening an indi- 
vidual in such a manner as to make the threat 
quite separate from the task he was attempt- 
ing to accomplish or in such a way as to 
avoid having the subject perceive the threat 
as directly and inescapably directed at self, 
one might succeed in getting only a mild 
degree of threat experience. 

2. The physical or psychological immediacy 
of the threat perceived. It would appear 
that a threatening perception which must 
be dealt with at once is more threatening 
than one which could be postponed or avoided 
in some fashion. The threat of tomorrow's 
examination may be greatly reduced if the 
subject is able to postpone or avoid it. It 
seemed to us, therefore, that if we could de- 
vise a means of making the threatening per- 
ception secondary to the task to be performed 
and left entirely within the subject’s control 
to perceive as immediate or remote, we should 
in some measure assure a mild, rather than 
a traumatic degree of threat experience. 

3. The feeling of adequacy to deal with 
the threatening perception possessed by the 
subject. Presumably, threatening perceptions 
in which the subject feels adequate to deal 
with the threat experienced are less likely to 
be strongly threatening than those in which 
the subject feels inadequate to deal with the 
threat he perceives. We attempted, therefore, 
to give our subjects a feeling of complete 
adequacy in handling the experimental task 
and introduced our threat situations in a 
manner as to make it possible for our sub- 
jects to feel quite adequate to handle them 
effectively by any number of devices which 
he might select. 

With these criteria in mind we devised our 
experiment to create for our subjects the mild- 
est possible degree of threat experience. All our 
subjects were volunteers “interested in taking 
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part in an experiment.” Since an experiment 
is a more or less cold blooded, objective sort 
of situation, our subjects were comparatively 
free to perceive the threats to which they were 
subjected as having little to do with them- 
selves as individuals. 

We presented our threat situations in such 
a manner that they appeared to have nothing 
to do with the task to be performed. So 
far as our subjects were concerned, the prob- 
lem with which they were working was learn- 
ing to translate sentences into code. Any 
threat which they may have felt in the 
course of the experiment from our threat 
stimuli was not necessarily related to the task 
to be performed or immediate to personality 
structure. 

Finally, the task we chose for our subjects 
was a very simple one which they had ample 
opportunity to learn to the point where they 
themselves expressed satisfaction that they 
knew it perfectly They could feel entirely 
adequate to deal with the task. Since the 
threats presented to the subject were quite 
secondary to the task and within his own 
control at all times, he could also feel com- 
pletely adequate to deal with the threatening 
perceptions he made during the course of the 
experiment. He could, for instance, relegate 
them to the “not self” by perceiving them as 
“having nothing to do with me,” as being 
“absurd,” as “referring to someone else,” etc. 
or, if he felt any personal reference of the 
threats he perceived, he could always sup- 
press the perception and avoid dealing with 
it. 

While it is impossible to assess accurately 
the degree of threat felt by our subjects in 
measurable terms, we nevertheless feel quite 
certain that we achieved only the mildest 
form of threat for our subjects. Observations 
during the experiment indicate that most of 
our subjects were quite relaxed and appeared 
to be enjoying themselves. At no time were 
there any external indications of severe shock, 
discomfort, or undue tension in their be- 
havior. 


The Task 


The task we selected for our experiment 
was a simple code, familiar to many school 
children and similar to that used by Ter- 
man (9, p. 330) in the “average adult” scale 
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of the 1916 Stanford Revision of the Binet- 
Simon Intelligence Scale. 

Subjects were instructed to memorize the 
code and were given a sample sentence prop- 
erly coded as well as a series of practice sen- 
tences to translate. When the code had been 
thoroughly learned to the satisfaction of 
the subject, he was assured that results would 
be known only to himself and the examiner 
and the experiment proper was begun. 


The Threat and Nonthreat Stimuli 

Two series of sentences were devised: a 
threat series and a neutral series. Both series 
of sentences were thirty letters in length, con- 
tained as many letters from the first half of 
the alphabet as the last half and contained 
approximately the same number of vowels. 
All sentences were printed in capital letters 
to reduce possible location cues. In trans- 
lating these sentences to code, each letter 
had to be placed in a continuous series of 
thirty boxes on a sheet of paper. This was 
done to minimize the effect of differing word 
lengths and to prevent any easy repetition 
of later letters in a series from the spotting of 
earlier ones. 

Seventeen neutral sentences were devised, 
expressing some common observation which 
did not appear to contain much personal 
reference in the best judgment of the ex- 
aminers. The first six of these were used as 
practice items and were always given first. 
They were not used in tabulating our re- 
sults. The neutral sentences used for the 
experiment proper were: 

He came to buy the flowerpots for her. 

No teacher tends to leave very early. 

The campus grew quite drab in winter. 

His girl wore flowers on her tan suit. 

My scarf is printed in greys and reds. 

He told what he wanted from our store. 

Now a large house would be just right. 

Some bushes were planted by the boys. 

Most gardens are more damp in spring. 

It was a spot of paint on the black tie. 

Daniel and LeRoy want a windup train. 


Ten threat-provoking sentences were pre- 
pared which in the judgment of the experi- 
menters seemed likely to be repugnant or 
embarrassing ideas to our subjects. The list 
follows: 


1. I always perform very badly on codes. 
2. Something is wrong with my sex organ. 
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. My family do not respect my judgment. 
. The silly worms crawled over my neck. 
. It is stupid to believe there is a God. 

. I said that my mother is a dirty whore. 
. My hand shakes whenever I am nervous. 
. Now on my date I belched quite loudly. 
. His brain was crushed to a mushy pulp. 
. Anything sexual always disgusts me. 


The Experimental Procedure 


Twenty-seven men and 23 women students, 
volunteers from advanced undergraduate sections 
in psychology, served as subjects for our experi- 
ment. As soon as the code had been learned to 
the satisfaction of the subject, the code card was 
removed from sight and the experiment was begun. 
Both sets of sentences, each sentence on a 3 by 
5 inch card, were thoroughly shuffled and reas- 
sembled as follows: first, the six practice cards were 
given the subject to translate although in varying 
order. Following this, neutral and threat cards 
were given in alternating sequence. 

Rest periods were allowed whenever the sub- 
ject desired them. Accurate time by stop watch 
was kept and recorded on a score sheet which also 
had space for error scores, after the coding had 
been checked for accuracy and after the subject 
had left the room. Four examiners were used, all 
well briefed so that accuracy of method and simi- 
larity of approach were assured. There were no 
significant differences in results obtained by dif- 
ferent examiners. 


A Validity Check on Our Criteria 

In our experiment proper we attempted to 
assure the mildness of threat by making it 
possible for the subject to “not see” the 
threat if he so desired or to perceive it as 
being remote from himself. The perception 
or nonperception of threat was left entirely 
in his control. As a matter of curiosity on 
the part of the examiners “to see what would 
happen” and as a check on the validity of our 
assumptions concerning the criteria govern- 
ing the degree of threat experienced, two 
instances of other types of threat were intro- 
duced in our design. 

We reasoned that, if, in the course of the 
experiment, the examiners were personally 
to threaten the subject mildly such threats 
would be immediate and outside the subject’s 
control. If our criteria were accurate such 
threats should be stronger than other threats 
in the experiment and should produce more 
errors and require a greater time for com- 
pletion of the task. Accordingly at the pleas- 
ure of the examiner, and just preceding the 
administration of any nonthreat sentence, the 
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examiner threatened the subject by stating 
mildly: 

a. “You aren’t concentrating, I’m afraid. You 
can do better than this.” (Capability threat) 

b. “Can’t you do it a little bit faster?” (Time 
threat) 


Experimental Results 


Average time and error scores were calcu- 
lated for the threat and neutral sequences of 
sentences for each of our subjects. Examiner 
threats given at a time of a neutral presenta- 
tion were scored as threat sentences. In 
every instance but one, our subjects required 
more time to complete the threat sequence 
than the nonthreat sequence. Similarly, with 
a single exception, our subjects made more 
errors under threat than in response to the 
neutral items. 

The critical ratios of the differences between 
our threat and nonthreat populations appear 
in Table 1. 


TABLE 1 


Means, STANDARD DEVIATIONS AND CriTicaL Ratios 
oF DirFERENCES FOR TIME AND Error StTupies 
UNDER THREAT AND NoNTHREAT CONDIT‘ONS 


Errors 





Time IN SECONDS 





Non- 
THREAT 


Non- 


THREAT THREAT THREAT 





87.60 1.82 1.17 
10.53 43 7 
1.50 A -107 
4.58 


Mean 94.92 
SD 10.74 
SE, means 1.53 
CR 3.41 





It would appear that our experimental pre- 
diction of increased time and error under 
mild threat is amply borne out by these 
results. 

The five sentences producing greatest time 
and errors arranged in decreasing magnitude 
were as follows: 


t. A nonthreat sentence preceded by examiner’s 
“time threat.” 

2. My family do not respect my judgment. 

3. Now on my date I belched quite loudly. 

4. A nonthreat sentence preceded by examiner’s 
“capability threat.” 

5. It is stupid to believe there is a God. 


Interestingly enough, sentence number 6 
of our threat group which we had expected 
would prove to be most threatening of all our 
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sentences, proved actually to be least threaten- 
ing. Questioning of our subjects suggests 
that this sentence, far from being threatening, 
was regarded bv our subjects as so absurd as 
to be amusing. Many of our subjects when 
confronted with this card chuckled aloud. 
This seems to us an interesting demonstra- 
tion of the perceptive and personal character 
of threat. Threat is what appears to one as 
threatening, irrespective of what outside ob- 
servers think of the situation. 

The relatively high positions of our two 
external threat sentences in the above sequence 
seems to bear out our prediction of greater 
strength of such type of threats and lends a 
measure of support to the criteria of threat 
on which our sentences were constructed. 
It would appear that we were quite success- 
ful in our construction of mildly threatening 
items. 


Conclusion 


Our prediction of the effect of mild threat 
on performance seems amply demonstrated 
by the results of this experiment. As All- 
port (1) has pointed out, however, accurate 
prediction does not necessarily validate the 
basis frem which the prediction is made, 
but it certainly lends support to the supposi- 
tion. 

Whether the particular theoretical orienta- 
tion from which the prediction was made 
proves accurate or not, there seems little 
doubt that it is possible to demonstrate that 
even a fairly mild perception of threat can 
seriously affect the adequacy of an _ indi- 
vidual’s behavior. This seems to us a prin- 
ciple worthy of far more attention than it 
has received in the psychological literature. 
If even a mild perception of threat can 
markedly affect the adequacy of behavior, the 
implications of such a principle for psycho- 
therapy, education, social action, or diplomacy 
are too far-reaching to be neglected as a 
psychological problem. 


SUMMARY 

The prediction was made that the intro- 
duction of a mild degree of personal threat 
in the course of the solution of a simple 
task would result in an increase of time re- 
quired to complete the task and of errors 
in performance. Fifty college students were 
given the task of translating sentences into 
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a simple code. Mildly threatening sentences 
were interspersed among neutral sentences. 
With a single exception, threat sentences re- 
quired longer to complete and produced 
greater errors in translation. Experimental 
results appear to amply demonstrate the pre- 
diction. 
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HE concept of class as an aid in the ex- 
“T  planation of social behavior has become 

increasingly important in the field of 
psychology. The early development of this 
concept by Max Weber (18) was, from an 
economic frame of reference, similar to the 
Marxian concept of the economic interest 
group. Contemporary sociologists and social 
psychologists have not only recognized in 
varying degrees the economic component of 
class, but have also to great advantage in- 
cluded the status concept in their differentia- 
tion of the American culture. 

Various methods have been used to dif- 
ferentiate our culture into classes, yet the 
characteristics of the groups differentiated 
by these apparently different methods are 
somewhat similar. For example, Warner (17) 
and his students employed a system of classi- 
fication which is dependent upon a number 
of status characteristics, including occupation, 
source of income, house type, and dwelling 
area. In addition, he employs informant 
criteria and pooled information data for de- 
fining class, largely in terms of prestige level. 
Davis (4) used records of participation in 
socioeconomic groups and expressions of 
willingness to associate with others in social 
relationships as the criteria for classifying 
his subjects into classes. Centers (3) has de- 
veloped a system of class differentiation that 
depends on the consciously recognized group 
with which individuals identify themselves. 
In addition, he employed occupational strata 
in the analysis of his data. Hollingshead (9) 
employed ratings by different informants as 
the criteria for classifying the subjects in his 
study. 

Thus, the middle class, as determined by 
Warner's system of status characteristics, has 
many factors in common with the middle 
class as determined by Davis’ common par- 
ticipation technique. Both of these groups 
have many characteristics in common with 
the middle class as determined by Centers’ 


*A PhD. thesis performed under the direction of 
Franklin J. Shaw at Purdue University, June 1951. 


system of class differentiation. To the ex- 
tent that these groups are similar, we may 
synthesize ideal learning environments for 
the greater part of the American culture and 
formulate hypotheses as to the behavior ex- 
pected in terms of the results of these studies 
of class structure. 

In general the studies of class-related be- 
havior are what might be termed segmental 
studies, with the interpretations likewise seg- 
mental. From the results of several of these 
studies briefly presented here, the author will 
attempt in this paper to develop and explore 
a cultural configuration approach to class- 
related behavior. This emphasis upon cul- 
tural configurations is in part a reaction 
against the segmental interpretations pre- 
sented by many writers in the culture-per- 
sonality field. Their approach has been the 
result, mainly, of choosing a particular psy- 
chological frame of reference, generally neo- 
Freudian, or none at all; abstracting very 
minute areas of the total cultural configura- 
tion; and interpreting the minute phenomena 
involved as the causal factors in creating the 
mature personality of the culture or sub- 
culture. For example, La Barre (12) virtually 
ascribes the main features of Japanese person- 
ality to the rigid bowel training of infants. 
Kluckhohn and Mowrer (11), on the other 
hand, have made an attempt at a more con- 
figurative approach in that they postulate 
that too precipitous training of the child in 
weaning, cleanliness, sex taboos, and aggres- 
sion-control lays the groundwork for “obses- 
sive ambition” and “severe competitive be- 
havior” in adults of cultures employing such 
a configuration of socialization techniques. 
This approach, which tends toward the con- 
figurative approach to class, has not often 
been employed by culture-personality people. 

Ericson (5) reports that there were vast 
differences in feeding techniques, toilet and 
cleanliness training, and age of assumption 
of responsibility in the middle and lower 
classes in the American culture. Specifically 
the pattern of infant care in the lower class 
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was more permissive than that of the middle 


class. Lower-class mothers breast-fed their 
children more often than middle-class mothers 
and weaned them later. Lower-class infants 
were on a self-demand schedule of feeding 
more frequently. Toilet training started at 
a significantly later date in the lower class, 
and yet it was completed at the same time as 
in the middle class. The middle class en- 
forced early demands on the child, while in 
the lower class this training was not instituted 
until the child could learn with relative ease. 

Davis (4) found that the expression of 
emotion and aggression was more direct in 
the lower class than in the middle class. The 
threat of love withdrawal was also employed 
as a major socialization technique by the 
middle class but not by the lower class. Kin- 
sey et al. (10) found that sexual expression is 
permitted earlier and that a heterosexual out- 
let occurs more frequently in the lower class. 
Green (8) has developed the thesis that the 
neurosis of the middle-class male results 
mainly from socialization techniques, which 
provoke anxiety, and from emphasis on attain- 
ing higher status. Gorer (7) and Mead (14) 
have, in their anthropological analysis of the 
American culture, implicitly suggested that 
the middle class is the most neurotic of the 
three broad classes. 

Centers (2) found that the middle class 
followed a deferred gratification pattern with 
emphasis on future goals. Hollingshead (9) 
noted that members of the lower classes did 
not tend to forego immediate pleasures in 
the interest of attaining long-range goals. 
Davis (4) also noted that the members of 
low income groups (lower class) sought im- 
mediate gratification whenever possible. Also 
in support of this finding, Goldschmidt (6) 
has noted that members of the lower class 
frequently preferred expensive consumer 
goods to getting a better home. Such be- 
havior was frequently found by Centers (2) 
to be related to the individual’s estimate of 
his own position in the power structure. 
Bakke (1) has found that where little eco- 
nomic security existed in a particular class, 
planning for the future was thwarted. 

From the empirical and theoretical obser- 
vations of these men, the author will now 
attempt to formulate general cultural con- 
figurations of the middle and lower classes 
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in the American culture. From consideration 
of the work of Ericson, Davis, and Kinsey, we 
may agree with the theorizing of Green, Gorer, 
and Mead, that the middle-class cultural con- 
figuration contains many anxiety-provoking 
stimuli. On the other hand, while the lower 
class may offer little economic security, as 
Bakke has pointed out, the socializing factors 
in this culture are much less anxiety-arousing. 
As Centers and Hollingshead have shown, 
other socializing agencies in the middle class 
teach that immediate pleasures should be 
foregone in the interest of gaining long-range 
goals. 

From this class configuration, although 
recognized as incomplete, we may make cer- 
tain generalizations about middle-class and 
lower-class personalities. The middle-class 
personality may be thought of as highly 
anxious and maintaining expectations of play- 
ing non-pleasure roles. The lower-class per- 
sonality may be thought of as relatively free 
from anxiety and holding expectations of 
playing immediately pleasurable roles. How- 
ever, in the author’s experience in connection 
with the overt behavior of the adolescents of 
both the middle and lower classes, he has 
found a much smaller difference in the role- 
playing of the two classes than might be 
predicted from the above configurations of 
the classes. 

Until recently, the concept of role has been 
employed almost entirely as a descriptive 
tool by sociologists, psychologist, and anthro- 
pologists in the analysis of social organization 
and disorganization. Linton’s (13) work is 
an example of this general kind of usage. In 
this present study, as in an earlier study (15) 
by this author, the concept of role has been 
used as an empirical tool to determine the 
amount of role-conflict in individuals. 

In consideration of the discussion above, 
the major hypothesis for this study was de- 
veloped in terms of the following set of 
postulates: 

Postulate I. Role expectations which are 
more difficult to realize are less likely to 
be realized. 

Postulate II. Non-pleasure role expecta- 
tions are more difficult.to realize than role 
expectations involving the attainment of 
pleasure. 

Postulate III. Role expectations involving 
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the attainment of pleasure are more per- 
missible in the lower class than in the middle 
class. 

From the three foregoing postulates, the fol- 
lowing hypothesis was formulated: 

Members of the middle class will experi- 
ence more role-conflicts than members of the 
lower class. 

This hypothesis must be supplemented with 
the assumption that, when a lower-class indi- 
vidual does choose non-pleasure role expecta- 
tions, he does so on the basis of immediate 
and pressing considerations. Since he will 
act in accordance with these immediate and 
pressing considerations, he will have no role- 
conflict. 

In addition to this major hypothesis, a 
minor hypothesis was developed: 

An individual who is highly anxious as 
a result of middle-class socialization with its 
great emphasis on high status would tend to 
evaluate himself as happier than lower-class in- 
dividuals, while a lower-class individual would 
tend to evaluate himself as less happy on the 
basis of lack of real property or equally as 
happy because of greater freedom of expres- 
sion. 


Three related questions of lesser importance 
to be explored in this study were as follows: 
1. What is the relation between an occu- 
pational classification of manipulator of things 
and Warner’s lower-class classification, and 
a classification of manipulator of persons 


and Warner’s middle-class classification? * 

2. What is the relation between the class 
placement of subjects by the director of a 
recreation center and by Warner’s classifica- 
tion? 

3. What is the relation between the length 
of time a boy’s family has occupied a given 
class level and the number of role-conflicts 
he reports? 

With these hypotheses and questions in 
mind, this study was set up to determine the 
extent to which conflict between role expecta- 
tion and roles played, as well as self evalua- 
tion of happiness, are related to an individual’s 
class level. 

1 These classifications have been employed by Eliza- 
beth M. Douvan, with impressive results, in the study, 
“The Influence of Social Class Membership on Re- 


actions to Failure,” an unpublished doctoral dissertation, 
University of Michigan, 1951. 
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PROCEDURE 


In order to test the hypotheses and questions, it 
was decided to study the role-conflicts among male 
adolescents through the interview method. This 
group was selected because it was easily accessi- 
ble and certain desired controls, such as age and 
class, were inherent in the group. In order to 
reach youth at different class levels, plans were 
made to select boys from two recreational centers 
located in different types of neighborhoods. 

One of the recreational centers is located in the 
largest slum area of the city and is attended ex- 
clusively by boys of the lower class. The other 
center, which is in an average housing area, is 
frequented gencrally by members of the middle 
class. The city, located in an east north central 
state, can be characterized as relatively small with 
a stable native white population and with no 
ethnic groupings. Its population in 1940 was about 
12,000 with 1 per cent foreign born and 4 per 
cent Negro and with less than 2 per cent transciency 
in a one-year period. It is the county seat, serves 
as the marketing center for the surrounding 
agricultural area, and has a small men’s college 
located there. 

Before interviewing any boys, the author estab- 
lished himself as an accepted part of each recreation 
center by conforming in dress and participating in 
the activities. Acceptance was especially necessary 
at the lower-class center since anyone connected 
with education was treated with antagonism. 


Schedule 


The schedule developed for use in the interview 
consists of a series of 30 paired questions. The 
first set lists questions on role expectations in a 
boy’s daily life while the second set has a corres- 
ponding question on the role played by S wita 
respect to each role expectation. The questions 
were put in semi-random order so that those re- 
ferring to roles in similar situations were not ad- 
jacent. This approach was used on the assumption 
that it would reduce any predisposition to answer 
related questions with reference to the mental 
set sometimes created by such grouping. 

The content of the questions relates to various 
aspects of behavior--sex, education, honesty, per- 
sonal habits, etc. The questions were so phrased 
as to be answered by yes or no. For example, the 
first role expectation question is, “Should a boy 
be home by a certain time at night?”, while the 
corresponding role-played question is, “Are you 
home by a certain time at night?” 

In constructing the schedule, the author originally 
prepared a list of some 50 items suggesting various 


2For the complete schedule of questions used in 
this study, order Document 3426 from American 
Documentation Institute, 1719 N Street, N.W., Wash- 
ington 6, D. C., remitting $1.00 for microfilm or 
$1.00 for photocopies (6 x 8 in.) readable without optical 
aid. 

8 The author has not attempted to differentiate theo- 
retically among role playing, role behavior, behavior in 
a position, or the expectations of these three forms of 
behavior. 
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aspects of adolescent role expectations, based on a 
review of contemporary literature on adolescent 
behavior. These items were then evaluated in 
terms of offering pleasure and non-pleasure role 
expectations or role-playing alternatives. This re- 
view eliminated a number of items not considered 
feasible. The remaining items, put in question 
form, were pretested on five Ss at each recreational 
center, revised, again pretested on five more Ss at 
each center, and finally revised. In this process 


ambiguous questions were removed and questions 
were reworded for greater clarity and for com- 
parable understanding by boys of each class level. 


Interview 

To obtain Ss for interviewing, the author en- 
listed the cooperation of a member of the recrea- 
tion staff to interest boys at the center in partici- 
pating in the study. Arrangements were made for 
the use of a private quiet room at the center, usually 
the recreation director’s office. On evenings when 
the author was at the center, the staff member 
brought each S individually to the office. 

The interview with each S followed a general 
pattern and lasted about a half-hour. All answers 
were recorded by short pencil on a mimeographed 
form. The interviewer first established rapport 
with S by offering a cigarette and talking about 
some subject of interest to S. During this period S 
was asked certain identifying informatioa, namely: 
name, age, school grade, home address, father’s 
occupation, amount of income, number of years 
in one occupational level, and occupation of the 
paternal grandfather and number of years in one 
occupational level. 

The interviewer then paraphrased the following 
statement to S: 


I am making a study of why people do things. 
I am going to ask you sor * questions. I have 
heard all the answers e so don’t be afraid 
of shocking me with r answers. I don’t 
give a darn what they are as long as they are 
the truth. As you have probably noticed, your 
name does not appear on this tabulation sheet, 
and no one will ever know what your answers 
are. In fact, after interviewing one or two more 
boys, I won't be able to remember your answers 
myself! Please answer each question, insofar 
as possible, with a yes or no. When the words 
“a boy” are used in a question, they mean a boy 
about your age and like you in most ways. 


The S was next asked to answer the first 30 ques- 
tions about the role expectations of a boy in his 
daily life. After a few minutes’ break for relax- 
ation, the second set of 30 questions abuut the roles 
played by S were asked. After another period of 
relaxation, the interview was concluded by the 
following questions: 

1. Can you think of any other times when you 
should do one thing and you do another? 

2. How do you think your everyday happiness 


*The Purdue Opinion Poll and many other sources 
were valuable in securing suggestions for the items. 
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compares to the happiness of the boys that go to 
the other recreation center? 

After each S left, the director of the recreation 
center estimated his class level. The following day 
the interviewer drove by S’s home and rated the 
house and dwelling area by inspection. Later he 
rated the occupation and source of income of 
each S's father. 


Sample 

The data presented in this paper are based on 
a sample of 100 white adolescent boys, 50 from the 
lower-class center and 50 from the middle-class 
center. Both groups were similar in age and educa- 
tion. The boys ranged in age from 14 to 18 years 
with an average of 15.9 years and a median of 16.0 
years. Their schooling ranged from the eighth 
through twelfth grade, including some no longer 
in school. Nine boys who participated were found 
to be in indeterminate classes and were accordingly 
excluded from the analysis. 

Through observing the community and interview- 
ing civic recreation leaders, the author determined 
that 'there were roughly five general class: levels 
similar to those described by Warner (17). The 
class I group (the highest prestige group) did not 
take part in the recreation programs of the city, 
but the rest of the adolescent boys did. In classify- 
ing the sample into class levels, the author em- 
ployed the Warner system because it is more sug- 
gestive of the nature of learning environments 
than other systems and because it could be readily 
adapted for use in the community selected. The - 
author found that the Jonesville classifications em- 
ployed by Warner (16) were accurate, except in 
the intermediate areas, through classifying 20 Ss 
by Warner’s I.S.C. scale and then determining 
through various channels of information the actual 
class levels of the Ss. The Ss falling in the inter- 
mediate areas between classes II and III or between 
IV and V were classified according to the informa- 
tion gathered on S's participation in a specific class. 
The nine Ss falling in the intermediate areas 
between classes III and IV were excluded from the 
sample. 

The four ratings for each S (house, dwelling 
area, father’s occupation, and source of income) 
were multiplied by the weighted score, and the 
products were totaled and the class level determined 
according to Warner's (17) class assignment of 
weighted totals. Warner's class assignment was 
determined following the interview so as to keep 
a running tabulation of the number of Ss needed 
in each group to reach the goal of 25 per group. 
The evaluation of occupation in terms of manipu- 
lator of things vs. people was also made at this 
time, and an assignment of class II or III (middle 
class) was assigned to the manipulator of people and 
an assignment of class IV or V (lower class) was 
assigned to the manipulator of things. The con- 
flict score was also determined at this time. A 
conflict which arises from an unrealized role ex- 
pectation was designated by an x in the column 
created for this purpose. These x’s were totaled 
and considered the conflict score. 
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Correlations were then run on class level as 
related to: conflict score, director’s evaluation, and 
manipulation classification. Role-conflicts missed 
were surveyed and listed, and happiness evalua- 
tions were tabulated. 


Discussion AND REsuULTs 

As previously stated, the criterion employed 
as an indicator of conflict was the disagree- 
ment between role expectation for a boy and 
the role played by S on each of the 30 pairs 
of scheduled questions. The results are pre- 
sented and discussed in terms of the hypoth- 
eses and questions raised. 

Examination of the interview records for 
pretest groups one and two indicated that 
conflict resulting from unrealized role ex- 
pectations clearly existed in the adolescent 
personality. A low correlation between the 
number of role-conflicts and class level ap- 
peared in the first pretest group. This same 
relationship, much larger, was also indicated 
for the second pretest group. 

Certain of the data confirm the three 
postulates stated. In particular, the findings 
show that there is a positive relationship 
between the number of non-pleasure expecta- 
tions and the prestige level of the class. An 
analysis of variance shows that the variance 


of the non-pleasure role expectations re- 
ported by Ss of each class level is significant 


at the .o5 level of confidence. The correla- 
tion between S’s non-pleasure expectations 
and his class level is +.75. The differences 
between the mean number of non-pleasure 
expectations of classes II and III, 27.1 and 
28.2 respectively, and between classes TV and 
V, 16.8 and 13.0 respectively, were not sig- 


nificant (Table 1). The difference between 


TABLE 1 
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anp Rote-Conrticts aT Eacn Crass Lever 
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* Significantly different from mean of next higher 
class level. 


the mean number of non-pleasure expecta- 
tions for classes III and IV, 28.2 and 16.8 re- 
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spectively, is significant at the .o5 level of 
confidence. With a ¢-value of 2.45, this dif- 
ference approaches significance at the .or level 
of confidence. These results appear to sup- 
port the third postulate of the major hy- 
pothesis. That is, there is a significantly 
larger number of non-pleasure expectations 
held at the middle class (classes II and III) 
than at the lower class (classes IV and V). 

The average non-pleasure choice per ques- 
tion for the middle-class sample (92 per cent) 
was considerably higher than that for the 
lower-class sample (49 per cent). The middle 
class unanimously chose the non-pleasure 
alternative on one third of the questions em- 
ployed in the interview. On 73 per cent of 
the questions, the middle-class boys chose 
the non-pleasure alternative at least go per 
cent of the time. Some of the relatively low 
questions in terms of non-pleasure choice 
are: 76 per cent of the middle-class boys re- 
ported the expectation of not gambling. In 
64 per cent of the cases, the lowest incidence 
of non-pleasure expectations, the middle-class 
boy felt he should not smoke even if he 
wished to do so. The middle-class boys re- 
ported that they should allow their parents to 
influence their choice of friends in only 68 
per cent of the cases. In 72 per cent of the 
cases, the middle-class boys felt they should 
accept a “bawling out” from their parents 
without talking back. 

On the other hand, there were only two 
notably low non-pleasure choices for the 
lower-class group. Only 10 per cent of the 
lower-class sample reported the expectation of 
allowing their parents to influence their 
choice of friends, while only 14 per cent of 
the lower-class boys reported they should use 
their free time to get ahead instead of enjoy- 
ing themselves. The lower-class boys fre- 
quently indicated through remarks after mak- 
ing a non-pleasure choice, that it was being 
made in terms of immediate and pressing 
cqnsiderations. An example of such a con- 
sideration was when a lower-class boy reported 
that he should be home at a certain time 
at night. He then added, “I gotta be in 
at a certain time, because cops patrol and 
take us in in this neighborhood if we are 
out after curfew time.” Another example is 
when a lower-class boy reported that he felt 
that he should settle differences by talking 
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instead of with his fists. He added, “I’m so 
small, I always get beat up when I fight.” 

The major hypothesis posed is supported 
by evidence from this study that a positive 
relationship exists between the conflict score 
and class level. An analysis of variance shows 
that the variance in the number of conflicts 
reported by Ss at each class level is significant 
at the .o1 level of confidence. The relationship 
between Ss’ conflict scores and their class level 
is +.78. The differences between the mean 
conflict scores of classes II and III, 12.4 and 
11.8 respectively, and between classes IV and 
V, 5.2 and 2.8 respectively, are not significant 
(Table 1). The difference between the mean 
conflict scores of classes III and IV, 11.8 and 
5.2 respectively, is significant at the .05 level 
of confidence. With a tvalue of 2.28 this 
difference approaches significance at the .o1 
level of confidence. 

Because of the lack of significance between 
the mean conflict scores of classes II vs. III 
and classes IV vs. V, a correlation was run 
between conflict scores and class level, the 
class levels being middle class (II and III 
combined) and lower class (IV and V com- 
Although this r 


bined). The r was +.76. 


is not significantly different from the r of 
+.78, the difference may be due to the 


character of the distribution. This finding 
may suggest a slight trend of more conflict 
at each higher class level although the trend 
is not significant. Consideration of these 
results suggests that the major hypothesis 
has been supported. Comparison of the two 
class levels shows that the average conflict 
score for the middle-class sample is 12.1 or 
48 per cent of the total number of conflicts 
possible, while the average conflict score for 
the lower-class sample is 4.0 or 16 per cent 
of the total number of conflicts possible. 
Four exceptionally high areas of conflict 
reported by the middle class (classes II and III 
combined) relate to obedience to parents, 
consideration of parents’ feelings, cheating, 
and use of free time. Specifically, go per 
cent of the middle-class boys report conflict 
over the fact that they should always obey 
their parents instead of doing what they 
wish to do, while 72 per cent reported con- 
flict over the fact that they do not consider 
the way their parents might feel before doing 
certain things. Conflict occurred for 70 per 
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cent of these boys because of cheating when 
they have the expectation of not cheating. 
Conflict over not using their free time to get 
ahead when they hold the expectation that 
they should was reported by 78 per cent 
of this group. 

The conflict scores on all items are low for 
the lower-class group (classes IV and V 
combined) relative to the middle-class group 
(classes II and III combined). The highest con- 
flict scores in the lower-class group relate to 
conflicts over cheating when they feel they 
should not (26 per cent); conflicts of not con- 
sidering the way their parents might feel before 
doing certain things (24 per cent); conflicts 
over the fact that they feel they should always 
try to settle their differences with other boys 
by talking instead of with their fists and yet 
they do fight (24 per cent); and conflict over 
not being good community members (24 
per cent). In the conflict about settling dif- 
ferences, it is interesting to note that most of 
the 24 per cent are upper lower-class boys 
while only a few of the lower lower-ciass boys 
report such conflict. 

As part of the interview, Ss were asked if 
they had any other role-conflicts. They were 
usually unable to suggest role-conflicts not 
already covered. They found it difficult to 
grasp the concept of role expectation and 
role played even when presented in relatively 
simple terms. Their answers were mostly 
reiterations of questions included in the 
schedule. There were, however, two conflicts 
clearly indicated by a number of middle-class 
subjects. They were: (a) They expected 
themselves to enter high school sports but 
did not. (4) They expected themselves to 
drive carefully but did not. A large number 
of lower-class boys reported they should con- 
tribute more to the financial aid of the family 
than they were doing. The frequency with 
which these areas of conflict were suggested 
would indicate that in further studies of this 
type, questions concerning these role expec- 
tations and roles played should be included. 

The minor hypothesis, that the middle- 
class boys would tend to rate themselves 
happier than the lower class, proved invalid. 
The differences between the average self- 
happiness ratings for the four classes were 
not significant. In addition to a low correla- 
tion of +.30, there was no significant dif- 
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ference between the mean happiness ratings 
of the various classes. 

In answer to the first question raised, the 
data showed that there is a high relationship 
between the occupational classification of ma- 
nipulator of things and Warner’s lower-class 
classification and the classification of manip- 
ulator of persons and Warner’s middle-class 
classification. The coefficient of reliability 
was +.78. This finding suggests that if a 
relatively accurate estimate of an individual’s 
class level is desired and a more accurate 
method cannot be used because of time or 
other limitations, the criterion of manipulator 
of things or manipulator of persons may be 
employed since they are classifications sug- 
gestive of factors in the individual’s learning 
environment effecting personality develop- 
ment. 

In connection with the question about the 
relationship between the class placement of 
Ss by the recreational director and by War- 
ner’s class classification, the data indicate 
this relationship to be high. The coefficient 
of reliability was +.84. This high correla- 


tion might be due to the fact that the director 
is in a situation in which he is required to 
cooperate and work closely with people of 


all class levels; thus, he must clearly per- 
ceive the social structure to achieve his ends. 

An analysis of conflicts in relation to length 
of time in class level was abandoned since 
the four classes exhibited a high degree of 
stability (Table 2). However, an analysis of 


TABLE 2 
OccupPaTIONAL Lever or FATHER AND GRANDFATHER 
as Retatep to Crass Lever 





°% oF ° oF 


AveracGeE Fartners PATERNAL 
NuMBER IN GRAND- 
oF YEARS PRESENT FATHERS 
FATHER IN Occu. IN SAME 
PresENt Group Occu. 
Occvu. MORE THAN GROUP AS 
Group 15 YEARS FATHER ® 


°% AND 
DireEcTION 
or ToTaL 
Mositity ® 





Total 
average 76 16 up 
73 28 up 
Il 9 68 24 up 
IV ( 88 12 down 


) 
Vv 6 75 24 down 





* These data were computed from the reported occu- 
pational group of 81 paternal grandfathers; 19 Ss did 
their grandfather’s occupation had 


not know what 


been. 
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total mobility for all Ss who were able to 
report the paternal grandfather’s occupation 
was made from these data even though such 
analysis was not originally intended as part 
of the study. 

In the present sample all mobility originated 
in the lower-middle and upper-lower classes, 
classes III and IV. The total mobility of 
the sample is 16 per cent upward (Table 2). 
Classes II and III contained the largest 
number of mobile families. All these families 
reported upward mobility of one class level. 
Classes IV and V reported less mobility on 
the whole and all mobility was in a down- 
ward direction. No family in class V moved 
up while no family in class II moved down. 
This suggests that the upper-lower class is 
decreasing in size relatively faster than the 
lower-middle class. But in either case, the 
data from the sample suggest that the class 
differentiation seems to be increasing for the 
greater part of the total population.° 


SUMMARY 


The number of role-conflicts, as indicated 
through answers to a schedule of questions 
during interviews with adolescent males, was 
compared with the class level, as operationally 
defined by Warner’s classification system, in 
order to test the major hypothesis that there 
will be a greater number of role-conflicts 
with middle-class individuals than lower-class 
individuals. The study indicates a signifi- 
cantly higher number of conflicts in the 
middle class compared with the lower class. 
The relationship between role-conflicts and 
the four class levels appears to be relatively 
high (r=+.78). In addition, the middle 
class had a significantly higher number of 
non-pleasure expectations. The relation be- 
tween non-pleasure expectations and prestige 
class was high (r==+-.75). 

A minor hypothesis tested in this study was 
that middle-class boys will tend to evaluate 
themselves as equally happy or more happy 
than lower-class boys, while lower-class boys 
will tend to evaluate themselves as less happy 
than middle-class boys, or equally as happy 


5 This finding appears to support Karl Marx’s hy- 
pothesis of “Social Evolution.” Due to the extremely 
small sample, the author feels that this inference is 
merely suggestive and that further investigation is de- 
sirable before any conclusions can be drawn as to its 
validity. 
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as middle-class boys. The findings suggest 
that self-happiness ratings by an adolescent 
do not relate to class level. 

Several lesser questions were also consid- 
ered. A high correlation (r—=-+.78) was 
found between the occupational classification 
of manipulator of things and Warner’s lower- 
class classification, and the classification of 
manipulator of persons and Warner’s middle- 
class classification. Likewise a high correla- 
tion (r==-+-.84) was found between classifica- 
tion of S by the director of the recreation 
center and the Warner classification. 

The stability of occupation in each class 
level made it impossible to relate role-con- 
flicts to length of time in class level. Some of 
the findings suggest that class differentiation 
may be increasing. Further studies might 
well include some additional role-conflicts 
brought to light through this study but not 
explored here. 

Caution must be exercised in generalizing 
from the data derived from the sample used 
in this study to the population at large. 
Specifically, it would be necessary to establish 
that the class stratification of other com- 
munities is comparable to the stratification of 
the community from which the sample em- 
ployed in this study was drawn before gen- 
eralization of the findings to other communi- 
ties could be made. 
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MEMORY AND PSYCHOLOGICAL SECURITY * 
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Unwwersity of Nebraska 


HE problem of memory dynamics is one 
Twit has long been attended to but not 

yet resolved in many of its aspects. Ad- 
ler’s (1) theorizing on the dynamics of recall, 
based upon clinical data, stressed the adaptive 
functions of memory. He clearly describes 
the selective character of memories, which, he 
indicates, are recalled in accordance with a 
current attitude or psychic tendency. Freud 
(10) reaches a similar conclusion regarding 
the selective organization of memory via his 
repression theory, the primary difference be- 
ing that Freud was more interested in for- 
getting than in remembering. The theory of 
repression stimulated a great deal of experi- 
mentation on memory and forgetting. Most 
of these studies, as reviewed by Meltzer (14), 
showed that the majority of subjects tended to 
forget the unpleasant. However, in Rapa- 
port’s opinion, all of these investigations suf- 
fered from a lack of demonstrated relevancy 


to the theory being tested (16, pp. 76-77). 
A major question left unanswered concerned 
the influence of individual differences upon 


recall. Adler’s views and Meltzer’s restate- 
ment of the issue emphasize that pleasant and 
unpleasant memories must be associated with 
other significant individual factors. 

On the basis of his extensive experimental 
study, Bartlett (4) formulated a theory which 
is in essential agreement with Adler’s con- 
ception of memory as an instrumental func- 
tion serving certain individual ends. Almost 
identical hypotheses were developed by Bar- 
rett (3) in 1938 and Edwards (g) in 1941 
using carefully obtained research data. The 
problem is stated by each of these authors 
in terms of a “frame of mind” or “frame of 
reference” hypothesis. Both statements imply 
that those experiences which harmonize best 
with a current frame of reference will be 
best remembered. These researches, plus a 
host of studies stemming from the Lewin- 
Zeigarnik experiment (19), tend to confirm 

1The author wishes to express his appreciation to 


Drs. M. R. Jones and I. Simos for a reading and criti- 
cism of this paper and to W. G. Sawrey for statistical aid. 
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Adler’s original theory of the selective dy- 
namic nature of recall. 

However, as Waldfogel writes, “More work 
should be done on the dynamics of memory 
optimism and memory pessimism and on per- 
sonality differences between memory opti- 
mists and memory pessimists” (18, p. 31). 
Working out the details of the relationship 
between the memory function and personality 
factors is of theoretical interest and of pos- 
sible diagnostic value. 

Of special interest to the present study is 
the literature dealing with childhood mem- 
ories. Ansbacher (2, p. 203) writes a brief but 
clear statement of the essential differences 
in viewpoint between Freud and Adler. “(a) 
Adler sees no particular difficulty in their in- 
terpretation (referring to childhood mem- 
ories) whereas Freud does.” Here Ansbacher 
is drawing upon Freud’s notions concerning 
the screen-like character of early childhood 
memories. “(b) Adler considers the earliest 
incident one can recall as particularly illumi- 
nating, which Freud never pointed out; (c) 
Freud stresses infantile amnesia, which Adler 
never mentions.” 

An initial attempt to investigate the sig- 
nificance of the earliest childhood recollection 
was made by Ansbacher (2). He collected 
earliest memories from each of 271 male col- 
lege students and related certain types of 
memories to scores on the Maslow Security- 
Insecurity Test (12). He found tentative cor- 
roboration of the view that the earliest recol- 
lection is particularly expressive of an indi- 
vidual’s basic attitude toward life. However, 
his conciusion is questionable since he pre- 
sents neither significance statistics nor ade- 
quate comparisons to back up impressions 
obtained at a fairly crude quantitative level. 

Crook and Harden (6) and Child (s) 
tested the hypothesis that repression of child- 
hood memories is associated with neurotic 
tendencies by correlating Pressey X-O Test 
scores with the age of earliest recall and with 
the total number of memories. Child’s study, 
which was the better controlled, found almost 
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no correlation between the purported meas- 
ure of infantile amnesia and degree of neu- 
roticism. No attempt was made in either 
of these studies to determine the emotions 
associated with these memories. Dudycha 
and Dudycha (8), the only investigators to 
include a differential emotional analysis in 
their study, nevertheless did not relate these 
results to any personality variables. Wald- 
fogel (18), in one aspect of his study, sought 
to measure the relationship between degree 
of memory optimism-pessimism and several 
different personality variables, among them 
emotional stability and ascendancy-submis- 
sion. Although his results were essentially 
negative, he felt that inadequacies of his 
measuring instruments may have obscured 
any existing relationships. 

From the foregoing summary it can be 
seen that there are marked deficiencies in our 
knowledge concerning, first, the details of 
the relationship between memory attributes 
and personality variables, and second, the 
psychological significance of early childhood 
recollections with special reference to the 
theory of infantile amnesia. The present 
study is intended to help bridge some of 
these gaps. 

Specifically, the aims were as follows: 

1. To determine whether certain charac- 
teristics of memory, primarily affective in 
nature, are related to security feelings as 
measured by the Maslow Security-Insecurity 
Test. The recall involved in this aspect of 
the study included childhood, pre-adolescent, 
adolescent, and adult memories. Feelings of 
guilt and fear are significant aspects of the 
syndrome of psychological insecurity as de- 
fined and measured by the Security-Insecurity 
Test. Therefore, the expectation is that the 
proportion of guilt or fear memories in the 
total number of memories will be signif- 
icantly higher for the insecure group than 
for the secure group. Results of this type 
would support Adler and the modern “frame 
of reference” theorists. 

2. Following Freud’s conception, which re- 
gards the typical childhood recollection as a 
screen memory, one would not expect to find 
significant relationships between emotional 
features of early memories and _ personality 
variables by the obvious checklist technique 
employed in this study. On this basis, it is 
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expected that a cornparison of childhood 
memories with adult memories will reveal 
a greater number of significant relationships 
between adult memory characteristics and 
security feelings than between childhood 
memory characteristics and security feelings. 
Should the number of significant relation- 
ships be approximately the same for both 
childhood and adult memory characteristics 
it will be apparent that the screen memory is 
not as typical a phenomenon as Freud be- 
lieved. Similarly, the notion of an almost 
complete and universal infantile amnesia re- 
sulting from repression will be seriously 
challenged. 

3. According to Adler’s theories one’s ear- 
liest recollection is especially relevant in the 
investigation of one’s fundamental attitudes 
toward life. It is expected, therefore, that a 
comparison of the earliest memory with one 
selected at random, in this study the tenth 
of twenty memories, will reveal a greater 
number of significant relationships between 
earliest memory characteristics and security 
feelings than between tenth memory char- 
acteristics and security feelings. 


PROCEDURE 


The subjects were 126 college students, 
70 males and 56 females, registered in intro- 
ductory courses in psychology. The instruc- 
tiens given them were as follows: 


You are to write down first your earliest child- 
hood recollection. After that write four additional 
memories which seem outstanding prior to age six 
(approximately), then five outstanding memories 
referring to the age period seven to eleven inclusive, 
then five more covering twelve to sixteen inclusive, 
and finally, five outstanding memories over the last 
three years of your life. In all, then, you will have 
twenty memories. Remember to write down the 
earliest incident that comes to mind. Try to include 
only those memories where you feel certain you can 
actually recall the events. After you have written 
all twenty memories go back to the first one and 
indicate by a check in the proper box which of the 
characteristics listed at the left are appropriate to 
the memory. You may check as many character- 
istics per memory as seem pertinent. Do this for 
the entire group of memories. 


Instructions were explained carefully and 
all questions answered before any memories 
were written out. Subjects were assured of 
anonymity by use of a numerical identifica- 
tion system. 
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The following is the list of characteristics 
which were indicated. on the form used to 
collect memories: 


participating in group activities 
being alone 

pleasant 

unpleasant 

getting or losing prestige 

guilt, inferiority, embarrassment, jealousy 
self sufficiency 

danger or harm to self 

fear 

punishment, frustration 

satisfaction 

activity 

happy, elated, joyful 

worry, depression 

receiving kindness or aid 


Selection of these characteristics was based 
partly upon an attempt to conform to vari- 
ables defined as relevant to the Security test, 
partly upon Ansbacher’s (2) suggestions, and 
partly upon arbitrary choice where it seemed 
reasonab‘e to postulate relationships to secu- 
rity feelings. The list is neither exhaustive 
nor inflexible. 

Subsequent to the rating of the memories 
the initial twenty-five items from the Mas- 
low Security-Insecurity Test were adminis- 
tered. Among the aspects of insecurity the 
test is intended to measure are: feeling of iso- 
lation, rejection, uniqueness; perception of 
other human beings as essentially bad, of the 
world as dangerous; feelings of mistrust, 
threat, shame, guilt; a tendency to pessimism, 
unhappiness. Security is defined as: feel- 
ing of being liked, belonging, safety, friend- 
liness, calm; perception of the world as 
pleasant, of other human beings as essentially 
good; tendency toward optimism, happiness, 
self acceptance. 

This particular test was selected because 
(a) all that was desired at this stage of ex- 
perimentation was demonstration of a group 
tendency, (4) the scoring scheme of the test 
was developed on a college population thus 
making it appropriate to the present sample, 
and (c) the test is brief, yet carefully con- 
structed, with reliability and validity stand- 
ards quite satisfactory for this type of re- 
search. 

Correlation plots were constructed to deter- 
mine the relationship between security scores 
and memory characteristics. The distribu- 
tion of memory characteristics corresponded 
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to the total number of checks per character- 
istic per individual for all twenty memories. 
These plots were found, by inspection, to be 
uniformly rectilinear, and so a Pearson 
product-moment r was calculated for each. 

To answer the question concerning the 
relative significance of the earliest recollec- 
tion, the security scores were first divided in 
two at the mean of the distribution. It was 
expected that memories involving a charac- 
teristic such as fear would be unequally dis- 
tributed—the more insecure group having 
proportionately more fear memories. The 
characteristics studied were only those which 
had been found significantly correlated with 
security scores over the whole range of 
twenty memories. A test of the significance 
of the difference between proportions of each 
characteristic studied was then carried out on 
first and tenth memories only. The tenth 
memory was selected at random as a basis 
for evaluating the relative significance of the 
earliest memory. A similar procedure was 
used in evaluating the significance of affective 
characteristics when the five earliest child- 
hood memories were taken as a group and 
compared with the five adult memories. The 
proportion of fear memories in the secure 
group, for example, was obtained by totalling 
the number of early fear memories and di- 
viding them by the total number of affec- 
tive characteristics attributed to the first five 
memories. 


REsULTs AND Discussion 


Table 1 indicates that eight of the fifteen 
correlations yielded results significantly dif- 
ferent from zero and in the expected direc- 
tion. As was anticipated for this type of 
relatively crude group study, none of the 
single correlations was very high. However, 
the number of significant correlations ob- 
tained clearly demonstrates a_ relationship 
between psychological security and certain 
affective characteristics ascribed to memory. 
Some of the outstanding psychological fea- 
tures of the insecure, e.g., fearfulness, antici- 
pation of punishment or frustration, jealousy, 
inferiority feelings etc. are significantly re- 
vealed by the subjects’ own differential emo- 
tional analysis of their memories. 

An interesting by-product of the results is 
the datum pertinent to the classical problem 
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of memory optimism vs. memory pessimism. 
Rapaport (16), in his summary of relevant 
data to 1942, points out that most investi- 
gators agreed upon the existence of wide 
individual differences in the recollection of 
memories judged pleasant or unpleasant. 
Nevertheless, no attempt had been made to 
relate this feeling tone factor to relatively 
specific personality variables. Waldfogel (18) 
explored several measures of personality 
variables (e.g., Ascendancy-Submission Reac- 
tion Study, Wisconsin Scale of Conservatism- 
Radicalism, and the Thurstone Personality 
Schedule) in seeking a relationship between 
personality factors and memory optimism- 
pessimism with regard to childhood recollec- 
tions. He found only one instance of a 
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scores and unpleasant memories. Thus 
the personality variable, psychological secu- 
rity, is seen to be significantly related to 
judgments of the affective tone of memories. 

While the data are not sufficient for con- 
firming a causal relationship between secu- 
rity feelings and P-U memory tendencies, 
they do support a frame of reference hy- 
pothesis of memory dynamics. The obtained 
relationship is viewed, then, as a function of 
selectivity in what is recalled and how the 
product of recall is interpreted, i.e., the feel- 
ing tone ascribed to it. As Rapaport writes, 
“... the memory of the emotional tone of 
the experience and the present hedonic judg- 
ment of it are so hopelessly intermingled that 
the rating becomes an opinion...” (16, 


TABLE 1 


CoRRELATIONS BETWEEN SecuRITY ScorRES AND MeMoRY VARIABLES FOR THE TOTAL 


oF Twenty MEMorIEs 








VARIABLE 


VARIABLE r 





partic. in group activ. 

being alone 

pleasant 

unpleasant 

getting or losing prestige 

guilt, infer., jealousy, embarrassment 
self-sufficiency 


danger or harm to self .06 


fear 

punishment, frustration 
satisfaction 

activity 

happy, joyful 

worry, depression 
receiving kindness or aid 


—.41** 
.05 


22° 





* Significantly greater than zero at the .05 level of confidence. 
** Significantly greater than zero at the .01 level of confidence. 


significant difference in personality factors 
and that occurred in the case of male scores 
on the Thurstone Personality Schedule. The 
extreme memory pessimists (greater ratio of 
unpleasant to total memories) tended to have 
higher scores on the Thurstone test than the 
extreme memory optimists, indicating a 
higher degree of emotional instability or 
neurotic tendency for the former group. 
The results of the present study offer 
rather clear-cut evidence on this issue. 
Table 2 indicates that, for childhood and 
adult memories alike, secure subjects judge 
significantly more memories to be pleasant 
than do insecure subjects. Conversely, in- 
secure subjects judge significantly more 
memories to be unpleasant than do secure 
subjects. This trend is apparent throughout 
the entire range of twenty memories as 
shown by the positive correlation between 
security scores and pleasant memories and 
the negative correlation between security 


p. 69). Selective recall and interpretation are 
conditioned by personality needs of the 
moment—in this study the needs subsumed 
under the general personality factor security 


feelings. A similar interpretation holds for 
all other dynamic variables, e.g., guilt, frus- 
tration, etc., found to be significantly related 
to security scores. 

It should be noted that the number of sig- 
nificant relationships apparent is minimal for 
the data. The method of analysis used did 
not assume the advantage of comparing only 
the extremes of a distribution. Instead the 
two halves of a distribution were compared 
so that a coriservative estimate of significance 
level resulted. In view of these encouraging 
results a more intensive dynamic analysis of 
individual personality as related to memory 
features—both qualitative and quantitative— 
is certainly in order. Such a study, perhaps 
employing a methodology somewhat like 
Murray’s (15), would be invaluable in chart- 
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ing out more explicitly the relationships be- 
tween the memory function and personality. 

Of incidenial interest is the relative fre- 
quency with which different emotions are 
ascribed to childhood memories. Both Wald- 
fogel (18) and Dudycha (8) find that fear 
and joy occur most frequently. Waldfogel’s 
subjecis, however, judged joy to occur 
approximately twice as frequently as fear 
whereas Dudycha’s findings were almost 
exactly reversed—fear occurring about twice 
as frequently as joy. Again, the issue is 
clarified somewhat by considering the differ- 
ing frames of reference of the subjects. The 
ratio of joy to fear arnong childhood mem- 
ories of secure subjects is about 2.1:1 whereas 
it is only 1:1 for insecure subjects. Similar 
ratios are apparent when adult memories are 
compared—4.4:1 for secure subjects and 1.7:1 
for insecure subjects. In each instance the 
ratio of joy to fear memories among secure 
subjects is twice the ratio among insecure 
subjects. 


TABLE 2 


DIFFERENCES BETWEEN SECURE AND INSECURE GROUPS 
ON SELECTED VARIABLES 


Cup Memories ADULT MEMORIES 
SIGNIFICANCE SIGNIFICANCE 
LeveL LEVEL 


VARIABLE 





. being alone .09* .38° 
. pleasant .002°* -000 
. unpleasant .003° .o7* 
. guilt, inferiority, etc. .05° 02° 
. fear -32° .003 
f. punishment, frust. 02° .009 
. happy, joyful tad 


2°° 


.ooo1*® 07 
. worry, depressed 28° 18° 

a,c,d,e,f and h combined .ooo1* .ooo1*® 
b and g combined .0001°* .ooo1** 





* Indicates variables showing higher proportional fre- 
quency among insecure ; s. 

** Indicates variables showing higher proportional fre- 
quency among secure Ss. 


Perhaps the most interesting results of the 
study are those that deal with the relation of 
security scores to affective characteristics of 
childhood and adult memories considered 


separately. It seems quite conclusively 
demonstrated (see Table 2) that the emo- 
tional impressions ascribed to childhood 
memories are as significantly related to 
security feelings as the emotional impressions 
ascribed to adult memories. For each of 
these age periods six of the eight variables 
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treated yield a critical ratio significant beyond 
the .10 level of confidence—in most cases far 
beyond that. When those variables relevant 
to “security” and “insecurity,” respectively, 
are grouped and treated as single categories 
the significance of the difference between 
proportions greatly exceeds the .ooo1 level of 
confidence. 


Implications for the Theory of Infantile 
Amnesia 


In discussing the nature and dynamics of 
infantile amnesia Freud states: 


I refer to the peculiar amnesia which veils from 
most people (not from all) the first years of their 
childhood, usually the first six or eight years. So 
far, it has not occurred to us that this amnesia 
should surprise us, although we have good reasons 
for it. For we are informed that during those years 
which have left nothing except a few incompre- 
hensible memory fragments, we have vividly re- 
acted to impressions, that we have maniiested 
human pain and pleasure and that we have ex- 
pressed love, jealousy and other passions as they 
affect us... . 

We conclude therefore that we do not deal with 
a real forgetting of infantile impressions but rather 
with an amnesia similar to that observed in neu- 
rotics for later experiences, the nature of which 
consists in their being kept away from conscious- 
ness (repression) (11, pp. 581-582). 


On the basis of his finding that the average 


‘college student is able to recall some fifty 


memories for this early period, and that the 
number of memories increases with age (only 
memories to age eight were analyzed) as 
does the development of mental functions, 
Waldfogel raises a question about the valid- 
ity of the infantile amnesia doctrine. He 
then points out that the core of the theory is 
its insistence that only banal, meaningless 
representations of emotionally significant 
childhood experiences are retained by the 
adult. Freud’s comment on screen memories 
states: 


Memory deals with a mass of impressions received 
in later life by a process of selection, retaining what 
is important and omitting what is not; but with 
the recollections retained from childhood this is 
not so. They do not necessarily reflect important 
experiences in childhood, not even as must have 
seemed important from the child’s standpoint, but 
are often so banal and meaningless in themselves 
that we can only ask ourselves in amazement why 
just this particular detail escaped oblivion. I have 
tried with the help of analysis to attack the problem 
of childhood amnesia and of fragments of recollec- 
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tion which break through it, and have come to the 
conclusion that, whatever may appear to the con- 
trary, the child no less than the adult retains what 
is important; but that what is important is repre- 
sented in the memory by something trivial. For 
this reason I have called these childhood recollec- 
tions screen memories; a thorough analysis can 
evolve from them all that has been forgotten (10, 
p. 178). 


Taken together these two quotations give 
a distinct picture of Freud’s opinions on the 
problem of childhood recollections. What is 
important in the life of the child is said to be 
removed from consciousness by a massive 
repression process. It appears in the guise 
of screen memories via the mechanisms of 
displacement and condensation. Screen mem- 
ories are described as devoid of obvious 
emotional significance and requiring a thor- 
oughgoing analysis for identification of the 
underlying affective factors. The point 
essential to the present argument is Freud's 
implication that, of the childhood recollec- 
tions which are retained, the screen memory 
is the rule rather than the exception. 

Waldfogel’s subjects reported a wide range 
of emotional experience in describing their 
early childhood memories. The preponder- 
ance of affective over non-affective childhood 
memories is disclosed by the present study 
also. More than that, however, the affective 
characteristics of childhood memories are 
shown to be fully as meaningful as those of 
adult memories in relation to current secu- 
rity feelings. In terms of the superficial 
methods employed to discover these affective 
characteristics the results stand in direct 
opposition to the theory that apparently 
meaningless screen memories are the typical 
childhood recollections. 

As in previous studies of early memories a 
question may be raised as to the validity of 
the memories. In order to conclusively dis- 
prove the Freudian theory of infantile am- 
nesia and the typical nature of the screen 
memory one would have to authenticate each 
memory—an obviously difficult procedure. 
Some distortion of childhood, as well as 
adult, experience no doubt occurs. How- 
ever, Dudycha and Dudycha (7) verified 
memories as far back as the third year and 
earlier. This evidence, plus the nature of the 
instructions given the subjects and their cer- 
tainty that they could actually recall these 
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experiences provides the only assurance of 
the authenticity of the memories. 

One might argue that these recollections 
are concealing even more significant material 
and insist that, in that sense, they are screen 
memories. This is a definite possibility, yet, 
if it were so, it would alter the fundamental 
definition of a screen memory as an appar- 
ently meaningless, banal memory fragment. 
For it is obvious from the data that signifi- 
cant relationships do exist between early 
memory characteristics and security scores. 
At the very least, then, the results cast con- 
siderable doubt upon previous notions con- 
cerning the prevalence of screen memories, 
and indirectly, upon the notion of an all-per- 
vasive and universal infantile amnesia. 

One other consideration meriting attention 
is that Freud’s formulations grew out of his 
clinical experience with severe psychoneu- 
rotics for the most part. It may be that 
screen memories and infantile amnesias due 
to massive repressions of infantile sexuality 
are more common phenomena among a 
neurotic group than among a group of sup- 
posedly normal college students. This seems 
a point worthy of investigation. 

General support for the Adlerian view of 
early recollections is offered by the data. As 
Adler suspected, early memories proved no 
more difficult of interpretation than later 
memories. However, neither did they appear 
to possess any special significance in reveal- 
ing an individual’s fundamental security 
attitudes. When the earliest memory was 
compared with a memory selected at ran- 
dom, the tenth, no substantial differences 
were observed (see Table 3). Thus the 
hypothesis that the earliest memory is most 
illuminating was not supported. 


CONCLUSIONS AND SUGGESTIONS FOR FUTURE 
RESEARCH 


1. A relationship was demonstrated be- 
tween the personality factor, security feelings, 
affective characteristics 
Thus an initial ad- 


and a number of 
attributed to memory. 
vance was made toward detailing the rela- 
tionships between personality and memory 
functions. 

2. The issue of memory optimism-pessi- 
mism was clarified somewhat in terms of its 





Memory AND PsycHoLocicaL SECURITY 


TABLE 3 


CoMPARISON OF First AND TENTH MEMORIES FOR 
SecurE-INsEcURE GROUPS 








TenTtTH Memory 
SIGNIFICANCE 
LeveL 


7 

First MEMORY 

SIGNIFICANCE 
LeveL 


VARIABLE 





52° 47° 
. pleasant a -30°° 
. unpleasant 32° 25° 
. guilt, inferiority, etc. freq. too low 23° 
for calculations 
¢e. fear .96* .40° 
f. punishment, frust. 05° 5g® 
g. happy, joyful .38°* 35°° 
h. worry, depression frequency too low for calculations 
a,c,d,e,f and h .06* 04° 
b and g .18** 29% 


. being alone 





* Indicates variables showing higher proportional fre- 
quency among insecure Ss. 

** Indicates variables showing higher proportional fre- 
quency among secure 5s. 


relationship to a personality factor—in this 
study, security feelings. 

3. Affective characteristics attributed to 
early childhood memories were shown to be 
as significantly related to security feelings as 
were the affective characteristics attributed to 
adult memories. Thus the data challenge the 
notion that the superficially meaningless 
screen memory represents the typical child- 
hood recollection. 

4. The implication is that the distortions 
and omissions of childhood memories are no 
more striking or dramatic in character than 
the distortions and omissions of adult mem- 
ories. Serious doubt is cast upon the theory 
of infantile amnesia as a product of a massive 
repression process. Sheer increase in fre- 
quency of early memories with age may well 
be associated with the development of men- 
tal functions, e.g., language, memory, etc. (18, 
p. 15). 

5. The results are consonant with Adler’s 
position that childhood memories present no 
more difficulty in interpretation than adult 
memories. However, his hypothesis concern- 
ing the special significance of the earliest 
memory was not confirmed. 

6. The data are interpreted as being in 
accordance with the modern frame of refer- 
ence hypothesis concerning the memory func- 
tion—frame of reference being interpreted in 
the broadest sense as the personality structure 
and dynamics of the moment. 

Perhaps the most urgent research need, as 
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already suggested, is for studies which aim 
at intensive personality and memory analyses. 
A highly ambitious project, but one which 
might well supply critically important data, 
would be an intensive longitudinal study of 
the kind suggested above. Such a study 
would permit, among other things, an evalu- 
ation of the extent to which childhood mem- 
ories are distorted, and an account of the 
process and frequency with which screen 
memories are produced. 

Another approach could be aimed at col- 
lecting information on the different ways and 
frequencies with which different age groups 
recall childhood experiences. The orthodox 
doctrine of infantile amnesia suggests that 
the frequencies and characteristics of child- 
hood recall would be little influenced by age. 
Schachtel (17), however, in a very provoca- 
tive article, reinterprets infantile amnesia in 
terms of cultural conventionalizations of 
memory schemata whereby much of the indi- 
vidual flavor of experience is lost. This inter- 
pretation suggests the possibility of discover- 
ing differences in childhood recall between 
varying age groups on the basis of develop- 
mental degrees of conventionalization. 


SUMMARY 


The preceding investigation was designed 
to study the significance of characteristics, 
primarily affective in character, which are 
ascribed to memory by college students. Of 
special concern was the relationship of these 
affective characteristics to the psychological 
variable, security feelings. A major aim of 
the study involved the determination of 
whether childhood memories are as psycho- 
logically meaningful as adult memories when 
only a_ superficial analysis technique is 
employed. 

The subjects, 70 males and 56 females, were 
asked to record memories relevant to differ- 
ent periods of their life—five memories each 
for periods ranging from early childhood to 
adulthood. They were then asked to rate each 
of these memories in terms of a list of fifteen 
variables. The first twenty-five items of the 
Maslow Security-Insecurity Test were then 
administered. 

Significant correlations were obtained be- 
tween eight of the fifteen variables and secu- 
rity scores. Affective features of childhood 
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and adult memories alike were shown to be 
significantly related to security scores. It was 
also demonstrated that memory optimism- 
pessimism and the relative frequencies of joy 
vs. fear memories were significantly related 
to security feelings. 

The results were interpreted as challenging 
the notion that the screen memory is the 
typical childhood recollection. Further im- 
plications for the theory of infantile amnesia 
were discussed. Confirmation of a broad 
frame of reference hypothesis of memory 
dynamics was indicated and special note was 
made of the general support fer Adler’s 
views on early memories. Exception was 
taken only to Adler’s opinion concerning the 
fundamental importance of the very earliest 
incident an individual can recall. 

Some limitations of the current study were 
discussed and suggestions made for future 
research. 
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BODILY SIMILARITY EETWEEN SUBJECT AND CENTRAL FIGURE 
IN THE TAT AS AN INFLUENCE ON PROJECTION 


EDITH A. WEISSKOPF ann GEORGE P. DUNLEVY, JR. 
Purdue University 


ost clinicians who use the Thematic 
M Apperception Test and similar pic- 

torial techniques become aware of 
the fact that there is a coasiderable difference 
between the diagnostic value of pictures. 
Some pictures stimulate the subjects to give 
a large amount of pr@jective material, while 
others tend to elicit stories which contain rela- 
tively little material beyond a description of 
the objective stimulus. The choice of pictures 
is likely to be determined by trial-and-error 
rather than by systematic selection, unless 
some general laws can be found regarding 
the characteristics of pictures which affect the 
amount of projective material. Such laws 
could not only be of practical value for the 
development of new pictorial techniques but 
could also add to our theoretical understand- 
ing of the mechanism of projection (8, 12, 
13). 

It is often assumed that the amount of pro- 
jection elicited by pictures can be increased 
by increasing the similarity between the sub- 
jects and the figures shown on the pictures. 
The use of special picture series for groups 
of subjects with distinctive external charac- 
teristics is based on this assumption. Thomp- 
son, for example, was impressed with the 
sparsity of material in the TAT protocols 
of Negro subjects. He hypothesized that this 
sparsity might be caused by the fact that all 
human figures shown in the TAT pictures 
are white. In order to test this hypothesis, 
he developed a modification of the TAT for 
use with Negroes by substituting Negro fig- 
ures for the white figures on all pictures. He 
found, in agreement with his hypothesis, that 
his sample of Negro students told signifi- 
cantly longer stories about ten pictures from 
the Negro TAT than about the correspond- 
ing pictures from the traditional Murray 
TAT. However, a study by Riess, Schwartz, 
and Cottingham (7) does not support 
Thompson’s hypothesis. The authors found 
that the stories of Negro subjects do not dif- 
fer significantly in length when the pictorial 


figures are Negroes than when the pictorial 
figures are white. Nor was any significant 
difference found between the story length for 
white subjects on the Negro and on the white 
set of pictures. According to Korchin, Mitch- 
ell, and Meltzoff (4), even Thompson’s orig- 
inal observation that Negroes produce less 
material on Murray’s TAT than whites is not 
of general validity. It is interesting to note 
that Thompson’s subjects were Southern 
Negroes, while the subjects of the two op- 
posing studies were Northern Negroes. This 
difference suggests that the effect of variations 
in physical similarity between subject and 
pictorial figure might depend on the cultural 
connotations of the physical resemblance. 

Another attempt to increase the amount 
of projection by increasing the similarity 
between story-teller and pictorial figures was 
made by Lasaga and Martinez (5). When 
testing nuns and priests, the authors used a 
special set of TAT pictures in which the 
pictorial figures were nuns and priests. How- 
ever, clinical impression suggested that this 
modification did not increase the diagnostic 
value of the stories. 

An especially drastic attempt to increase 
the similarity between story-teller and pic- 
torial central figure in the TAT was made 
in an unpublished study by Weisskopf and 
Money, who substituted a photograph of the 
subject’s own face for the face of the central 
figure. No significant difference was found 
between the amount of projection elicited by 
pictures showing the subject’s face and pic- 
tures with a face lacking any marked sim- 
ilarity with the subject. 

Even the selection of four special sets of 
pictures for men, women, boys, and girls 
within the TAT itself is based on the assump- 
tion that similarity in age and sex between 
subject and central figure enhances projec- 
tion (6). The general impression of various 
clinicians, and studies such as Coleman’s (2) 
and Weisskopf’s (12) throw some doubt upon 
the validity of this assumption. 
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Not only modifications of the original 
TAT, but also other picture tests show the 
general trend to endow the pictorial figures 
with the same differential features which char- 
acterize the group of subjects for whom the 
test was designed. 

Symonds’ (g) test for adolescents, for ex- 
ample, depicts adolescent heroes. His selec- 
tion of pictures is based on judges’ ratings. 
He states that pictures with figures with 
whom the subject can identify as to age, sex, 
status, and general similarity were rated as 
most valuable for diagnostic purposes (8). 
However, ten years later Symonds qualified 
this statement by saying: 

jt is most natural for an adolescent to identify 
himself with an adolescent. But some adolescents 
find this difficult and must identify themselves with 
an older or younger person. And most adolescents 
find it possible to identify themselves with babies, 
with older men or women, with members of the 
opposite sex, with animals, and even with inanimate 
objects (9, Pp. 52, footnote). 


Symonds also observed that cross-sex identi- 
fication often led to stories that had a deeper 
significance. 

There are numerous other pictorial tests, in 
which the pictorial figures possess the same 
distinctive characteristics as the subjects for 
whom the test was designed. Henry’s (3) 
pictures for Navaho Indians show Navaho 
Indians as pictorial figures. Bachrach (1) de- 
picts crippled children in his pictures for use 
with such children. 

Summarizing, it can be stated that, while 
there is a trend to endow the pictorial cen- 
tral figure with similar characteristics as the 
subject, clinical impression as well as experi- 
mental studies make it appear extremely 
doubtful whether such similarity enhances 
projection. Most experimental evidence does 
not corroborate the assumption that projec- 
tion increases with increased similarity. 

It is the objective of the present study 
to make a further contribution to the test- 
ing of the hypothesis that similarity between 
story-teller and pictorial central figure in- 
creases the amount of projection. The type 
of similarity tested here is bodily similarity. 
The problem under investigation is: do sub- 
jects who are deviant as to their physique 
project more upon pictures showing physically 
deviant central figures than upon pictures 
with normal central figures? 


E. A. WeissKorr anp G. P. Duntevy, Jr. 


METHOD 


Subjects and Materials 


The subjects (Ss) were thirty male undergraduate 
college students. Ten subjects were obese, ten were 
crippled, and ten were of normal weight and without 
physical handicap. The criterion for obesity was 
any deviation of thirty pounds or more above stand- 
ard weight. The criterion for crippledness was im- 
paired locomotion due to any abnormality of the 
lower extremities necessitating the use of crutches 
or other orthopedic aids. The three groups were 
within comparable ranges of intelligence as meas- 
ured by the ACE test. Their Bernreuter scores on 
the BI-N scale were within the range indicating 
mental health. None of the subjects had ever taken 
the TAT prior to the experiment. 

The materials consisted of three sets of pictures. 
Each set included eight pictures. The first set con- 
sisted of reproductions of the following TAT pic- 
tures: 1, 2, 3BM, 4, 6BM, 7BM, 10, 13MF. The 
second set consisted of modifications of these pic- 
tures, whereby the central figure in each picture was 
enlarged to obese proportions. The third set dif- 
fered from the reproduced original TAT pictures 
merely by an added crutch supporting the body of 
the central figure or standing in close proximity. In 
addition to these modifications the following minor 
changes were made in the pictures: in Picture 2 the 
girl in the foreground was replaced by a boy. Since 
all Ss were male, the rationale for this change is 
obvious. Pictures 1 and 10 were modified so that 
the entire body of the figures could be seen. With- 
out this change the figures could not have been 
recognized as having normal, crippled, or obese 
bodies. 

Figure 1 shows Picture 13MF in its three modi- 
fications. 


Procedure 


All 24 pictures were individually 2dministered to 
each S. The experiment was conducted in three ses- 
sions for each S, with an interval of one week 
between sessions. Eight pictures were administered 
in each session. The order of presentation was con- 
trolled in two respects: Not more than one modifica- 
tion of any of the eight pictures was shown in the 
same session, and all three types of central figures 
were presented in each session. The order of pres- 
entation was kept constant for all Ss. 

The instructions differed from the traditional in- 
structions used with the TAT: S was asked to 
describe the pictures rather than to tell a story. This 
change in instruction was introduced for the follow- 
ing reason: The amount of projection elicited by 
each picture was measured by transcendence indices, 
a quantitative measure of projection introduced by 
Weisskopf (12); this index is obtained from de- 
scriptions of pictures rather than from stories about 
pictures. However, it can be assumed that, even 
though the instructions to describe might reduce 
the amount of projection as compared to the tradi- 
tional instructions, this reduction would leave the 
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Fic. 1. THree Moopirications oF Picrure 13MF 


relationship between the amount of projection in 
various responses unchanged. 

The transcendence index of each picture for each 
group of subjects was computed. “Jn order to ob- 
tain the transcendence index of a picture, the num- 
ber of comments about this picture which go 
beyond pure description were counted. The tran- 
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scendence index of the picture is the mean num- 
ber of such comments per subject. Pictures with 
low transcendence indices make it relatively easy 
for the subject to be factual and impersonal. Pic- 
tures with high transcendence indices make such 
impersonal observation difficult and, as it were, 
‘lure’ the Ss away from the objective path pre 
scribed by the instructions and force them to pro- 
ject” (12, p. 379). Thus, transcendence indices may 
be considered a measure of the amount of projec- 
tion elicited by a picture. This measure might be 
preferable in many ways to a word count, which is 
used in some of the studies discussed above (4, 7, 
10). Word counts include objective descriptions of 
the pictures which can hardly be considered projec- 
tions. 

The intra-judge reliability of the scoring system 
used was determined through rescoring of one 
hundred responses one year after the initial scoring. 
The junior author functioned as judge. The reli- 
ability was found to be .95. In an unpublished 
study by Weisskopf and Lynn it was found that the 
inter-judge reliability for three judges ranged from 
86 to .94. In an unpublished study, Beller and 
Haralson obtained an r of .86 for the inter-judge 
reliability for two judges and an r of .95 for the 
test-retest reliability. 


REsuLTs AND Discussion 


Figure 2 shows the mean transcendence 
indices of the three picture sets for the obese, 
normal, and crippled subjects. 


The analysis of variance was used to an- 
alyze the results. For this purpose, the tran- 
scendence counts of the individual stories were 


used as the basic data. The variables under 
examination were type of subject (normal, 
crippled, and obese), type of central figure 
(normal, crippled, and obese), and depicted 
scene. 

A test of interaction between the first two 
variables is the main issue under considera- 
tion. It was found that this interaction is not 
statistically significant. The result suggests 
that physical similarity between the subject 
and the pictorial central figure does not affect 
the amount of projective productivity. These 
findings are in agreement with other previ- 
ously discussed experimental studies (2, 4, 
7, 12). However, these results should be ac- 
cepted with reservations, since the measure of 
projection used in the present investigation is 
purely quantitative and thus does not take 
account of any qualitative differences in 
strength of projection. 

Nevertheless, the combined results of the 
present study and of previous investigations 
make it highly doubtful that anything can 
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be gained by depicting the pictorial central 
figure as like a member of the group for 
which the test is designed. 

The data yielded various additional results 
with no immediate pertinence to the main 
hypothesis. For example, the mean tran- 
scendence indices of the three sets of pictures 
differ from each other beyond the .02 level of 
significance. The use of Tukey’s method (11) 
of testing individual differences between 
means in an analysis of variance confirmed 
the impression gained from an inspection of 
Fig. 2, that the mean transcendence index of 
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figures than with obese ones. Crippledness, 
on the other hand, is considered even a 
greater handicap than obesity. However, in 
contrast to obesity, crippledness has dramatic 
and heroic connotations, which might make 
it more “tempting”: for subjects to establish 
identity with a crippled pictorial figure. Since 
crippledness results in helplessness while 
obesity does not, crippled pictorial figures 
may offer the subjects an opportunity to 
project their dependency needs. Qualitative 
inspection of the records supports this hy- 
pothesis. 
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AND CripPLep SUBJECTS 


pictures with obese central figures is smaller 
than the mean transcendence index of pic- 
tures with normal or crippled central figures. 
These differences are statistically significant 
beyond the .05 level. The difference between 
the pictures with normal and with crippled 
central figures, on the other hand, is not 
statistically significant. 

Thus, pictures with obese central figures 
appear to elicit less projective material than 
pictures with normal or crippled central fig- 
ures. People’s attitudes toward obese, crip- 
pled, and normally-built individuals might 
be responsible for this finding. In our cul- 
ture, physical normality is definitely consid- 
ered a more desirable condition than obesity. 
Thus, we might expect subjects to establish 
identity more willingly with normal pictorial 


A Tukey analysis (11) of the interaction of 
central figure with depicted scene suggests 
that pictures on which a crippled figure is 
shown with parent figures (Pictures 2 and 
6BM in their crippled modifications) tend to 
elicit more projection than the other pictures. 
This might be the case because parental pic- 
tures frequently stimulate the projection of 
dependency needs and crippled pictorial 
heroes are more appropriate recipients for 
dependency projection than normal or obese 
heroes. On the other hand, pictures which 
show crippled central figures and elicit pro- 
jections on topics such as S’s heterosexual 
relationships with contemporaries (Picture 4, 
in its crippled modification) are less stimu- 
lating than other pictures. In connection with 
these topics Ss may become somewhat blocked 
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when trying to identify with seriously handi- 
capped figures. These tentative interpretations 
might warrant additional testing through 
further research. 

Finally, the analysis of variance showed 
that the difference between the mean tran- 
scendence indices for normal, crippled, and 
obese subjects is statistically significant be- 
tween the .10 and .20 levels. There might 
be a slight trend for crippled Ss to project 
less than normal and obese Ss (cf. Fig. 2). 
The use of a larger number of Ss might have 
raised this trend to statistical significance. 
Again, more research is needed to test further 
and interpret this finding. 


SUMMARY 


1. The findings of this study did not sup- 
port the hypothesis that physical similarity 
between the subject and the central pictorial 
figure affects the amount of projection. 

2. Obese central figures did not elicit as 
much projection as normal or crippled fig- 
ures. These differences were significant be- 
yond the .05 level. 

3. Pictures 2 and 6, in their crippled modi- 
fications, elicit more projection than the other 


pictures. Pictures 1 and 4, in their crippled 
modifications, elicit less projection than the 
other pictures. 

4. Crippled subjects show a trend, without 
statistical significance, to be less productive 
than normal and obese subjects. 
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working in the psychoanalytical tradi- 

tion, have been acutely aware of the im- 
portant part which unconscious feeling reac- 
tions play in the dynamics of the therapeutic 
process. Particular stress has been laid on 
the patient's feelings toward the therapist (the 
transference) (8) although the therapist's un- 
conscious feeling reaction toward his patient 
(the countertransference) has been increas- 
ingly emphasized within the last years (1, 3, 
Q, II). 

A recent paper by one of the authors has in- 
dicated that measures of the therapist’s feeling 
reactions toward his patient can be utilized 
for assessing the competence of the therapist. 
These attitudes were inferred from the dis- 
tortions in the therapist’s perception of his pa- 
tient (4). The present study is an extension 
of this investigation. It will attempt to ex- 
plore the interactions of certain patient and 
therapist attitudes. Because of the difficulty 
of obtaining data of this nature the sample 
had to be far smaller than is desirable. 
This paper is thus primarily concerned with 
the methodological aspects of studying the 
interactions of the patient’s and therapist’s 
feelings and to suggest leads for further 
studies. 


PP working inti and especially those 


Hyporueses 


It is the basic assumption underlying the 
approach of this paper that each of the two 
individuals involved in the therapeutic situa- 
tion is unconsciously, and in part also con- 
sciously, aware of the other person’s feelings. 
We assume that inaccuracies in the way in 
which he perceives the other person’s feelings 
stem primarily from unconscious blocks or 
needs on the part of the perceiver. Thus, the 
patient’s need to see his therapist as near per- 
fect will distort his perception so as to make 
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the therapist seem more perfect; the thera- 
pist’s desire to be helpful will distort his per- 
ception in such a way as to make him see the 
patient as needing help. The degree and di- 
rection of the distortion is measurable by com- 
paring the perception with the perceiver’s own 
self-description and ideal-self description on 
one hand and the perceivee’s self-description 
on the other (4). Essentially, the assumption 
underlying this method is the same as that 
which underlies present day projective tests. 
In our case a person rather than an inkblot 
or picture is the relatively unstructured stim- 
ulus. In both cases, the need or attitude is in- 
ferred from the perceptual distortion. It is 
the hypothesis of this investigation that the 
attitudes which one of the two participants 
has toward himself and toward the other per- 
son will be meaningfully related to the atti- 
tudes held by the second person. 

While we have intercorrelated all variables 
which have been used in this investigation, 
certain variables were posited in advance as 
relating to therapeutic competence or the 
goodness of the therapeutic relationship. 
These expectations were based in part on the 
findings of a previous paper (4) and in part 
on theoretical expectations in the case of cer- 
tain other variables and will be so noted in 
the discussion. 


METHOoD 


Fifteen therapists and one of each of the 
various therapist’s patients served as subjects 
for this study. The therapists included 
twelve psychologists, two psychiatric social 
workers, and one psychiatrist. Though we 
attempted to use only psychoneurotic patients, 
one organic patient with personality malad- 
justment problems and one ambulatory 
schizophrenic patient previously diagnosed 
as psychoneurotic were unintentionally in- 
cluded in the patient group. 
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The procedure based on what has come 
to be called Qtechnique (10) was as fol- 
lows: 


Each therapist and each patient were given three 
identical arrays of 76 statements? taken from Mur- 
ray’s Explorations in Personality (7). They were 
asked to follow these instructions: 


Please sort these 76 statements so that they best 
describe you. Sort them into eight categories of 1, 5, 
12, 20, 20, 12, 5, and 1 statements each, so that the one 
statement which describes you best is on one extreme of 
this distribution, then the five next most descriptive 
statements are in the second pile, and so on. The one 
statement which is least characteristic of you will then 
be on the other extreme of your sorting. There should 
be no cards left over. Be sure you place the proper 
number on each pile. 


Using the same instructions, the therapists and 
patients were then asked to “sort these cards as 
you would ideally like to be,” and 


PAaTIENT-T HERAPIST Parrs 


447 


pute fifteen intercorrelations. For the purpeses of 
this study, we utilized a smaller number of variables 
which are described below, and also some other 
measures, such as the reputed competence of the 
therapist and time in treatment. 

In order to explore the interrelations between the 
measures which we derived from the sortings, we 
have ranked the fifteen patients and therapists on 
each one of these measures, and computed a rank- 
order correlation matrix. 

Finally, a qualitative analysis was made of the 
differences between the content of the therapists’ 
and patients’ perceptions of each other. 


Table 1 presents in schematic form the 
intercorrelations of the measures which have 
here been utilized. As will be noted, some 
are merely ranked correlation coefficients 
(e.g., the therapist’s prediction of how the 
patient would describe himself, and the pa- 


TABLE |! 


ScHEMATIC REPRESENTATION OF CORRELATIONS OBTAINED FROM 


Various SorTINGs 








T* Sevr T Ipear 


T Prep 


P® SELF P IDEAL 





self = 

” ideal TSS 
pred TAS 
self RS 
ideal Prsi 
pred Ppred 


UAS=AS?—RS? 
UASI=ASI*—RSI? 


TPRED 





* T=Therapist; P—Patient. 


Sort these cards as you think your patient (therapist) 
described himself. Do not sort them as you think he is, 
but try to duplicate his description of himself. (The 
patient was assured that his therapist would not see how 
the patient sorted for him, and whenever possible, all 
sortings were made on different days.) 


In accordance with Stephenson’s Q-technique 
(10), each of the statements was assigned a score 
ranging from 1 to 8, depending on the category 
into which it has been placed in each particular 
sorting. The statement sorted as most characteristic 
was given a score of 8, the least characteristic state- 
ment a score of 1. The entire distribution of scores, 
being by design normal, could then be intercorre- 
lated with every other sorting by the Pearson 
method of squared differences. 

We obtained six distributions of scores from 
each patient-therapist pair, and we thus could com- 


1 To save space in the Journat, the table of traits has 
been deposited with the American Documentation Insti- 
tute. Order Document 2433 from American Documen- 
tation Institute, 1719 N Street, N.W., Washington 6, 
D. C., remitting $1.00 for microfilm (images 1 in. high 
on standard 35 mm. motion picture film) or $1.00 for 
photocopies (6x 8 in.) readable without optical aid. 


tient’s actual self-description) while others 
are derived measures (UAS, UASI). Since 
we are interested in the distortions of percep- 
tions we must not only ask, for example, how 
similar or dissimilar the therapist thinks the 
patient’s self-description to be to his own but 
also whether he over- or underestimated the 
similarity between himself and the patient. 
The measures used in this study are briefly 
described below, along with the interpreta- 
tions which we have tentatively advanced. 

a. Self satisfaction (SS)? is obtained by correlat- 
ing the sorter’s self-description with the description 
of how he would ideally like to be. While high 
SS might be interpreted as real adjustment, that is, 
someone who is without tensions, who is in the 


2To facilitate interpretation of the symbols those 
denoting variables primarily derived from the therapist's 
sortings have been denoted with a T as the first letter 
and capitalized throughout. Variables derived primarily 
from patient sorts have been denoted by lower case and 
with a P as the first letter. 
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main satisfied with himelf, it is also found in very 
defensive or in smug individuals who cannot bear 
the thought of being imperfect. Whatever the indi- 
vidual case might be, SS is a measure of publicly ad- 
mitted satisfaction with oneself. 

b. Prediction (Pred) is obtained by correlating 
the predictive sort with the other person’s self-sort. 
It is essentially a measure of understanding the 
other person in the diagnostic sense of the word. 
We did not this variable to be related to 
ther variables. 

c. Real Similarity (RS) is obtained by correlating 
the self-sorts of the patient and therapist. “Real” in 
this case indicates, of course, only the real similarity 
between self-descriptions, and only to the extent to 
which the self descriptions reflect personality struc- 
ture can we speak of real similarity between the 
persons involved. This variable was also expected to 
be unrelated to others. 

d. Real Similarity to the Ideal (RS/) is obtained 
by correlating one person’s Ideal with the self-sort 
of the other. Again the limitations of (c) apply 
here. 

e. Unwarranted Assumed Similarity (UAS) is ob- 
tained by subtracting the squared “real similarity” 
from the squared “assumed similarity” (AS is the 
correlation between prediction-sort and self-sort of 
the predictor.)? In other words, UAS gives us an 
estimate of the overestimation of similarity which 
exists between the predictor and the predictee. 
TUAS is used to signify the therapist's unwarranted 
assumed similarity to his patient. Puas represents 
the patient’s unwarranted assumed similarity to his 
therapist. As has been more fully described in a 
previous paper (4), high TUAS may be interpreted 
to mean that the therapist sees the patient as like 
himself, that is, as a person who acts and feels like 
he, the therapist, does. We therefore infer that the 
therapist sees the patient as a person with whom he 
can identify, whom he can understand, like or 
respect; this presumably is indicative of a good 
therapeutic relationship. Negative TUAS is here 
interpreted to mean that the therapist sees the 
patient as somebody very unlike himself, and, we 
may therefore infer, as a person who falls into a 
different class, a person toward whom he wants 
to maintain a distance, thus indicating dislike.* 
Puas, on the other hand, suggests that the patient 
sees the therapist like himself, i.c., someone who 
needs help, w! 9 is himself maladjusted. Thus, high 
Puas probably indicates that the patient distrusts, 
or disrespects the therapist. A low Puas might then 
be compatible with an attitude of respect and liking 
by the patient. 

f. Unwarranted Assumed Similarity to the Ideal 
(UAS]) is obtained by subtracting the r7pg, from 
r?497, the assumed similarity to the Ideal. High 
positive TUASI can be interpreted to mean that the 


expect 


* While this measure has yielded good results it is not 
clear at this time whether the present mathematical treat- 
ment yields the most appropriate and efficient measures. 
Further work on this problem is now underway. 

4 This hypothesis has been tested and confirmed in a 
subsequent investigation which dealt with the relation- 
ship of these attitudes to sociometric choice.® 
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therapist overestimates his patient’s adjustment, that 
he believes the patient to be a stronger person (by 
the therapist’s standards) than the patient really is. 
This was here interpreted to mean that the therapist 
expects and demands attitudes (presumably of emo- 
tional gratification to the therapist) which the 
patient cannot fulfill. On the other hand, perceiv- 
ing the patient to be much less well adjusted, thus 
more in need of help, will probably cause the thera- 
pist to offer succor and help to the patient (See a.) 
Thus relatively lower TUASI/ will presumably occur 
in good therapeutic relationships. Puast, the pa- 
tient’s idealization of the therapist's qualities, seems 
to be a transference phenomenon which was recog- 
nized by Freud as the “overestimation of his [the 
therapist's] qualities” (6). Seeing the therapist as a 
very strong, self-sufficient person will permit the 
patient to make demands for emotional gratification, 
which, after all, is what the patient came to get. 
Low Puasi, which is quite rare, might indicate then 
that the transference is poor, hence Puasi was 
hypothesized to be correlated with therapeutic com- 
petence and good therapeutic relationships. 

g. Therapeutic Competence (Rx) was obtained 
from rankings by supervisors in the installation in 
which the therapists practiced. This is a rather un- 
satisfactory criterion of competence but represents 
the best one available for this study. The therapists 
who participated ranged in experience from physi- 
cians completing their psychoanalytic training, and 
clinical psychologists with extensive psychoanalytic 
or nondirective training to students in psychology 
and social work beginning their graduate work, or 
residents who had just completed a general intern- 
ship. The rank-order correlations which have been 
obtained are presented on Table 2. 


We shall briefly attempt to interpret the 
10 rank-order correlations which appear most 
promising as hypotheses for further study 


(see Table 2). Since these correlations are 
based on a sample of only 15 cases, all need 
to be cross validated in future studies. A 
correlation of .54 is significant at the 5 per 
cent level according to Olds’ tables. Since we 
calculated 84 coefficients, four could be ex- 
pected by chance alone. In addition, some 
of the correlations are based on correlated 
variables and are thus spuriously high. 
Reputed therapeutic competence (Rx) vs. 
prediction (Rx-Pred=.50). Better therapists 
in this sample tend to predict their patients’ 
self sorts more accurately than reputedly 
poorer therapists. This finding is not sup- 
ported by our other studies and may be 
explained in part by the correlation of .3 
between success of prediction and time in 
treatment. This correlation may be due to 
the fact that better therapists are able to hold 
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their patients in treatment for a longer time 
than poor therapists. 

Reputed therapeutic competence vs. self- 
satisfaction (Rx-TSS= —.50). Better thera- 
pists tend to be less self-satisfied than poorer 
therapists. 

Reputed therapeutic competence vs. the 
patient's idealization of the therapist (Rx- 
Puasi=.49). The better the therapist the 
more the patient tends to see him as an 
ideal. This is listed by Freud as one criterion 
of positive transference (6, p. 3). 

Therapist’s accuracy of prediction of the 
patient vs. the therapist's self-satisfaction 
(TPRED-TSS= —.48). The more self-satis- 
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like, or empathize with, his patient. This 
finding is related to the second and fifth 
above (Rx-TSS and TSS-Puas:). Since thera- 
pist’s and patient’s ideals stem from the same 
stereotype, as will be seen later, the therapist 
whose self-description correlates highly with 
the patient’s ideal is likely to be the one 
whose self-satisfaction is high. 

“Real similarity” of the patient to the thera- 
pist’s ideal vs. the patient's unwarranted as- 
sumed similarity (TRSI-Puas=.60). The 
more the patient resembles the therapist’s 
ideal, the more the patient tends to see the 
therapist like himself, that is, poorly adjusted 
and in need of help. The more adjusted the 
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RANK-OrperR CorRELATION MATRIX FOR 12 


VARIABLES IN PATIENT-THERAPIST Pairs 





Rx TPRED Ppred TSS 


Pss 


RS 


TRSI TUAS Puas TUASI Puasi TT** 


Prsi 





<o® 

21 

—5 
03 


—0o6o 


20 

© —48* 
32 

09 

I! 35 
= —24 
39 39 
o1 21 
05 —O05 
49° 38 


32 32 


02 
24 
09 
05 
04 


—26 


=I 
—28 
08 
—15 
—I2 
24 
—s50°* 





* Correlations discussed in this paper. 
** Time in treatment. 


fied therapists tend to be poorer predictors, 
that is, tend to understand their patients 
diagnostically less well. 

Therapist’s self-satisfaction vs. patient's 
idealization of the therapist (TSS-Puasi= 
—.50). The more self-satisfied the therapist, 
the less tendency of the patient to see him 
as an ego-ideal, that is, a person who in this 
case can be expected to provide satisfaction of 
needs to the patient. On the other hand, 
greater self-satisfaction on the part of the 
patient seems to have no relationship to the 
therapist’s expectations from the patient 
(TUASI). 

“Real similarity” of the therapist to the 
patient's ideal vs. therapist's unwarranted as- 
sumed similarity (Prsi-TUAS= —.50). The 
greater the similarity of the therapist to the 
patient’s ideal, the less the therapist tends to 


patient, the more he perceives the therapist 
as being similar to himself. 

Patient's self-satisfaction vs. patient's un- 
warranted assumption of similarity between 
himself and the therapist (Pss-Puas=.77). 
The more self-satisfied the patient the less 
does he feel that his therapist is better ad- 
justed than he. It is also interesting to note 
that the more self-satisfied patients tend to 
see their therapists in a less unwarrantedly 
idealized light (Pss-Puasi=—.42). This 
might be an indication of the resolution of 
transference, or that self-satisfied (smug) 
patients do not form transference relation- 
ships as easily as less self-satisfied patients. 

Patient's self satisfaction vs. “real similar- 
ity” between patient and therapist's ideal (Pss- 
TRSI=.57). The more self-satisfied patients 


resemble the therapist’s ideal more closely. 
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This is probably due to the fact that patients 
and therapists tend to share in a culturally 
determined ideal of adjustment, as will be 
shown below. 

Patient's unwarranted assumption of sim- 
ilarity between the therapist and himself vs. 
idealization of the therapist (Puas-Puasi= 
—.49). The more the therapist resembles, in 
the patient’s eyes, the patient’s ideal the less 
does the patient feel the therapist to be mal- 
adjusted like himself. 

We finally analyzed the rank-order correla- 
tion matrix by Butler’s method of rank pat- 
tern analysis (2).° 

We obtained two patterns (or factors), one 
consisting of (@) therapeutic competence as 
rated by the supervisor (Rx); (6) accuracy 
of the therapist’s prediction of the patient 
(TPRED); (c) the therapist’s identification 
with the patient, or the therapists’ overesti- 
mation of similarity between themselves and 
their patients (TUS); (d) the patient’s 
idealization of the therapist, which is at least 
one symptom of good transference (Puasi); 
(¢) low self-satisfaction on the part of the 
therapist (TSS); (f) dissimilarity of the 
therapist to the patient’s ideal (Prst). With 
the exception of TSS and Prsi these are vari- 
ables which we expected to find in good 
therapeutic relationships. The therapist’s 
identification with the patient has, of course, 
been previously found to be correlated with 
rated therapeutic competence (4). 

The other pattern, apparently indicative of 
poor therapeutic relationships, consisted of 
(a) the patient’s self-satisfaction (Pss); (6) 
the patient’s perception of similarity between 
himself and the therapist (Puas). In effect, 
the patient sees the therapist as being as mal- 
adjusted as the patient perceives himself to 
be; (c) the patient’s similarity to the thera- 
pist’s ideal (TRSI). It is probably rather 
threatening to the therapist when a malad- 
1 patient perceives himself as the thera- 
pist would ideally like to be: and finally (d) 
the t! rapl t’s dissimilarity to the patient’s 


ideal 


juste 


(Prsi). These results fit in quite well 
previous data which we have obtained 
in this and other studies. Perhaps of greatest 
importance may be the suggestion that the 

lf-satisfaction and his similarity to 


“ ith 


pat nt’s s¢ 


5 We are greatly indebted to Dr. Butler for making 
a blind analysis of these data for us. 
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the therapist’s ideal may permit unfavorable 
prognosis of therapeutic relationships. We 
have not previously suspected this finding 
which, if confirmed by later studies, may per- 
mit matching of therapists and patients to 
one another. 

One important question is whether some 
of these correlations might be spuriously 
high since the same elements are contained 
in both variables, e.g. when we correlate 
TSS-Puasi; Prsi-TUAS; TRSI-Puas; Pss- 
Puas; and Puas-Puasi. While this possibil- 
ity must be kept in mind, we feel that the 
relationships here obtained should be used 
as hypotheses for further investigations, 
especially since factors such as sex of the 
therapist, time in treatment, severity of ill- 
ness, frequency of treatment hours, have not 
been controlled in this study. 

Let us now briefly consider some of the 
differences in perception of the other person 
which exist between patients and their thera- 

ists. 

We find that all patients, except two with 
novice therapists, overestimated their thera- 
pist’s similarity to their ideal (i.e., idealized 
their therapist) while therapists tended to 
underestimate their patients similarity to 
their (the therapists’) ideal. The difference 
between mean Puasi and mean TUASI was 
examined by ¢-test for correlated means and is 
significant at the 1 per cent level (after con- 
version to 2’s). 

A difference significant at the 5 per cent 
level was obtained between the patient’s un- 
warranted assumed similarity between him- 
self and the therapist (Puwas) and between his 
ideal and the therapist (Pwast). This is not 
surprising if we consider that a patient in 
treatment is by implication dissatisfied with 
his present self and prone to feel that the 
person trying to help him is better adjusted 
than he, the patient. 

We indicated previously that patients and 
therapists tend to share the same ideal. This 
was here tested as follows: each of the 76 
statements was summed for the ideal sorts 
of all patients, and similarly for ideal sorts of 
all therapists. A composite scale was then 
constructed by placing the statement with the 
highest mean score for each group in category 
8, the five next highest statements in category 
7, and so on, until the distribution was again 
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the same as in the original instructions to our 
subjects. When we now correlate the com- 
posite patient ideal with the composite thera- 
pist ideal we obtain an r of .81 which suggests 


Patient-THeErapPist Pairs 451 


therapists tend to place more emphasis than 
patients on (@) outward, extroverted orien- 
tation, (4) interest in people and in com- 
fortable interpersonal relationships, and (c¢) 


TABLE 3 


Most anp Least CHARACTERISTIC CoMPosITE STATEMENTS 








CoMPOSITE 
PaTIENt’s IDEAL 


CoMPosITE 
THerapist’s IDEAL 


ComposiTE PaTIENT’s 
PERCEPTION OF THERAPISTS 





Most Characteristic (Category 8) 





I am consistent and dependable in 


my dealings with others. 


I feel fresh, vigorous and ready for 
anything most of the time. 


I have developed a good deal of 
self-control. 





Next Most Characteristic (Category 7) 





I have developed a good deal of 
self-control. 


I have a strong sense of responsi- 
bility about my duties. 


I finish most everything I start. 


I am interested in everything that 
is going on in the world: business, 
politics, social affairs, etc. 


When I wish to arrive at the truth, 


I inake a conscious attempt to 
eliminate sentiment and prejudice. 


I have arranged my life so that it 
runs smoothly and without conflict. 


I am consistent and dependable in 
my dealings with others. 


When I have to act, I am usually 
quick to make up my mind. 

I feel things deeply and personally, 
and am sensitive to the deeper feel- 
ings of others. 

I am interested in everything that 
is going on in the world; business, 
politics, social affairs, etc. 


I have a strong sense of responsi- 
bility about my duties. 

I organize my daily activities so 
that there is little confusion. 

I am consistent and dependable in 
my dealings with others. 


I consider a matter from every 
standpoint before I form an opinion. 


When I wish to arrive at the truth, 
I make a conscious attempt to 
eliminate sentiment and prejudice. 





Least Characteristic (Category 1) 





I do a great many things just to 
avoid criticism. 


I usually lack self-confidence when 
I have to compete against others. 


I often act on the spur of the 
moment without stopping to think. 





Next Least Characteristic (Category 2) 





I am more apt to give in than to 
continue the fight. 


lack self-confidence when 


others. 


I usually 


I have to compete against 


I am afraid of physical pain. 


I find it difficult to exclude irrele- 
vant ideas and pin myself down to 
one line of thought. 

My de sires 

one another. 


are often at war with 


I am likely to enjoy getting a per 
son’s goat. 


I feel nervous and anxious in the 
presence of superiors. 


I avoid very close intimacies with 
other people. 

I do a great many things just to 
avoid criticism. 


My desires are often at war with 


one another. 


I am more apt to give in than to 


continue to fight. 
I have difficulty controlling 


sexual impulses. 


my 


I feel upset if I hear that people 
are criticizing or blaming me. 


I do a great many things just to 
avoid criticism. 


I usually lack self-confidence when 
I have to compete against others. 








that both groups hold essentially the same 
traits as desirable and indicative of adjust- 
ment, that is, that the concept of adjustment 
may be shared in our culture. 

Inspection of the content of the most and 
next most characteristic statements (Table 3) 
in the descriptions of the ideal indicates that 


rational, intellectual control of action. Con- 
sistent with this finding are the statements on 
which widest differences between patients 
and therapists were found: 

“I feel fresh, vigorous, and ready for any- 
thing most of the time.” (Therapists’ ideal 
scale value of 8, patients’ of 6.) 
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“I finish most everything I start.” (Thera- 
pists’ ideal scale value of 5, patients’ of 7.) 

The therapists chose as least desirable (a) 
uncomfortable interpersonal _ relationships, 
especially those involving the expression of 
hostility toward others, (4) feelings of 
guilt and inadequacy, and (c) in addition, 
they bring in the idea of freedom from 
physical symptoms and from obsessive ideas, 
though they do not mention any specific type 
of interpersonal relationships as particularly 
undesirable. The item on which the two 
groups differed most at this end of the con- 
tinuum was “I avoid very close intimacies 
with other people.” (Composite patient ideal 
value of 4, composite therapist ideal value 
of 2.) Thus, in general, these patients em- 
phasize more than do their therapists, an 
absence of internal conflicts and the presence 
of rational controlled behavior; the therapists, 
in turn, value good interpersonal relation- 
ships and a certain amount of emotional 
spontaneity and lack of inhibition. 

With regard to impulsiveness, an interest- 
ing phenomenon occurs: the statement, “I 
often act on the spur of the moment without 
stopping to think,” was rated as follows: 
Patients’ prediction: 1; Patient’s ideal: 4; 
Therapists’ ideal: 4. This is one of the two 
statements where a score difference as great 
as 3 occurred. It seems here as if both groups 
regard this trait as equally neutral for them- 
selves, yet the patients choose it as being the 
most atypical (of the actual) characteristics 
of their therapists. 

The patients apparently were extremely 
impressed by the calm, controlled “thera- 
peutic neutrality” of their therapists, so im- 
pressed that they generalized this trait and 
unconsciously assumed this to be highly char- 
acteristic of the therapists at all times, even 
though they (the patients) as a group, did 
not desire it too highly. 

The other statement wherein a rank dif- 
ference of 3 occurred was “I feel fresh, vig- 
orous, and ready for anything, most of the 
time.” The patients saw this as just barely 
true of the therapists (predictive stereotype 
value of 5); the therapists’ ideal stereotype 
value of this statement was 8. Here again is 
pointed up the therapists’ high value upon 
spontaneity, health, and energy. This the 
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patients saw as present in the therapists, but 
not to the extent which the therapists wished. 

All the evidence thus seems to point to 
one consistent difference between the thera- 
pist and the patient groups, insofar as their 
ideals are concerned. This difference can be 
stated in terms of an emotional continuum, 
defined by impulsiveness at one end and 
rigidity at the other, i.e., therapists value and 
wish for freedom, spontaneity, and lack of 
rigidity more than do the patients; patients 
value and wish for control and stability more 
than do the therapists. This difference is in 
part, at least, a result of different charac- 
teristics of each group. The patients referred 
for psychotherapy at this clinic are, in gen- 
eral, more troubled by emotional impulsivity 
than by emotional constriction. Of the thera- 
pists at this clinic, probably the reverse is true. 
These findings highlight the necessity in this 
type of research for careful selection of 
patients and therapists, in terms of each 
group’s character structure, whenever such 
controls are feasible. However, with this 
reservation in mind, it is interesting to ob- 
serve that the therapists, who are themselves 
in search of greater emotional spontaneity, 
still inspire in their patients a desire for 
greater emotional control and stability. Two 
other factors might be operating here in 
addition to (and possibly more forcibly than) 
the selection factors just mentioned. One 
is the ability of the therapists to “soft-pedal” 
and/or disguise those aspects of their per- 
sonalities which would be distressing to the 
patients, and emphasize, instead, those aspects 
of their personalities which are reassuring 
and inspiring. The other is the overwhelm- 
ing need of the patients to “idealize” the 
therapists, or to endow them with the quali- 
ties which the patients themselves are search- 
ing for. Further investigation of this question 
would be valuable. 


SUMMARY 


Fifteen patient-therapist pairs were asked 
to describe (a) themselves, (4) how they 
would ideally like to be, and (c) how they 
believed the other member of the pair de- 


scribed himself. These descriptions were 
made by means of an array of 76 statements 
which were treated by Q-technique methods. 
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The various correlations and derived meas- 
ures which were obtained were ranked over 
the group of therapists and patients, and 
rank-order correlations were computed. The 
main portion of the study is an attempt to 
objectify and quantify certain aspects of 
therapeutic relationships. The findings of 


this section were presented as hypotheses for 
further research. The trends (and interpre- 
tations) of the data indicative of these atti- 
tudes were considered, as were the nature 
of the therapists’ ideals, the patients’ ideals, 
and the patients’ perception of their thera- 
pists. 
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THE LEARNING AND RETENTION OF WORDS WITH 
ASSOCIATION DISTURBANCES * 
JULIUS LAFFAL 2 


State University of lowa 


INCE Jung first gave the word association 
method a psychoanalytic interpretation, 
its validity in uncovering repressed 


emotional material, or at least in providing 
an index of severity of repression, has been 
generally accepted in clinical work. Its 
efficacy as a diagnostic instrument is assumed 
to lie in: (a) that it employs a procedure 
analogous to the free association method of 


psychoanalysis, and (4) that disturbances in 
association are attributable to anxiety-related, 
repressed * material, or “complexes.” 

Numerous studies, among them those em- 
ploying the method as a guilt indicator (1, 
6, 10, 13) have given support to the idea 
of “complexes.” Luria (10), and Huston, 
Shakow, and Erikson (4) found that subjects 
who, under hypnosis, had accepted self- 
humiliating fictitious accounts as truth, 
blocked on words which referred to the em- 
barrassing events, when tested in the post- 
hypnotic state. Studies by Smith (16) and 
Lynch (11) have pointed out a relationship 
between association disturbances,* emotional 
tone of words, and forgetting. 


1 Part of a thesis submitted in partial fulfillment of the 
requirements for the degree of Doctor of Philosophy in 
the Department of Psychology, State University of Iowa. 
The author wishes to express his sincere appreciation to 
Dr. A. L. Benton for his supervision of this research, to 
Dr. I. E. Farber for many helpful 
Dr. E. F. Lindquist for his advice on the experimental 
design. He is also indebted to Drs. Gustav Bergmann 
and W. W. Morris for their friendly interest 

2Now at the Veterans Administration Hospital in 
New 


suggestions, and to 


ngton, Conn 

has undergone many vicissi- 
within the writings of Freud The 
meanings are reviewed by Rapaport (13) and 
In general, by repression is meant the pre- 


The term “repression” 


tudes alone. 
various 
Sears (14) 
vention of the 
“presentation” (idea, image, feeling, etc.) which, were 
it to become conscious, would give rise to anxiety. The 
for relating association disturbance: in the word 
st to such material may be exemplified by 


even 
emergence into clear consciousness of any 


rational 
association te 
a quotation from Freud: “But the common character of 
the mildest, as well rest case to which faulty 
yntribute lies im the ability to refer 


as the seve 
and chance act 
the phenomena to unwelcome repressed, psychic material, 
which, though pr from is not 


robbed of all capacity to express itself” (2, p. 178). 


ms ¢ 


hed away consciousness, 


It appears to follow from the idea of the 
repression of anxiety-related material that 
words with association disturbances should 
take more trials to the learning criterion in 
a learning task than do words without asso- 
ciation disturbances, since “complex” factors 
will interfere with the normal acquisition of 
disturbing words. One would expect, fur- 
ther, that words with association disturbances 
should be more readily forgotten (repressed) 
than words without association disturbances. 
This expectation is based on the reasoning 
that repression is an active process tcarried 
out over a period of time,> whose net result 
is to reduce the strength of response tend- 
encies for those responses which are anxiety- 
arousing because of their relationship to 
“complexes.” Accordingly, two null hypothe- 
ses were tested in the present study: 

1. There is no difference in the number of 
trials required to learn words with association 
disturbances and words without association 
disturbances to a given learning criterion; 

2. There is no difference in the retention 
of words with association disturbances and 
words without association disturbances over 
a period of time. 


4“Association disturbance” and “complex indicator” 
are used synonymously in the literature of the word 
association test. 

5 The relation of repression to time has not been stated 
explicitly in the writings of Freud. However, Agnes A. 
Sharp gives reasons for believing that such a relationship 
is implied in the repression theory. She says: “In our 
opinion, Freud tacitly assumed that the differential effect 
of hedonic tone upon memory revival will be manifested 
after interval of time, and that this effect 
tends to increase with time. Freud did not assume that 
the memory revival of an unpleasant experience is imme- 
diately inhibited once and for all. Repression is actually 
an after-expulsion. Time is required for the mechanism 
of repression to be effective. In this connection we may 
note that many of the experiments that have been per- 
formed to test the truth of Freud's doctrine measured 
the relative revivability of pleasant and unpleasant items 
after an interval of only a day, and in these cases nega- 
tive results were generally obtained. Such experiments 
cannot be regarded as an adequate test of Freud's theory” 
(15, p. 


only 


some 


395). 
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PROCEDURE 
Experimental Procedure 


Eighty male students in the Introductory 
Psychology and Psychology of Adjustment 
classes at the State University of Iowa were 
used in the experiment. The three basic 
phases of the procedure were: 

1. Word association test, including the 
“reproduction” test. 

2. Learning of twenty words—ten with 
and ten without association disturbances— 
one day after the first part. 

3. Tests of retention of the learned words, 
at various intervals after the second part. 


In the first part of the experiment S was brought 
into the room and seated at a desk opposite E. E 
gave the following instructions: “I am going to 
read a list of words. After each word that I read, 
answer as quickly as possible with the first word 
that comes to your mind. For example, if I say, 
‘table,’ and the first word you think of is ‘chair,’ 
say ‘chair.. Do you understand?” If S indicated 
he did not understand, E repeated appropriate parts 
of the instructions. 

The roo words of the association list, which is 
shown in Table 1, were then read off one at a 
time, and S's response, reaction time, and other 
pertinent data as noted below, were recorded for 
each stimulus-word. Upon completion of the list, 
E gave the following instructions: “I am going to 
present the same list of words again. After each 
word that I read, I want you to try to give the 
same word that you gave the first time. You may 
take as long as you please this time.” The words 
were given again, and note was made wherever S$ 
failed to recall the word responded with originally. 
The reproduction test ended the first part of the 
experiment. In the interim (one day) between the 
first and second parts of the experiment, E selected 
ten stimulus-words with association disturbances 
and ten words without disturbances to be used in 
the learning part of the experiment. In preparing 
the list for learning, words with association disturb- 
ances (disturbing words) were simply alternated 
with words without disturbances (neutral words). 
For half the Ss a disturbing word began the list, 
and for half a neutral word. S was brought back 
to the experimental room one day after the word 
association test. E gave the following instructions: 
“I am going to present a series of pictures each one 
of which is paired with a word. Try to remember 
which words go with which pictures. After I have 
shown you the pictures and the words together, I 
am going to show you the pictures alone, and you 
are to try to give me the words that go with the 
pictures.” 

E exposed the pictures and words paired on 
white cards, allowing five seconds 
The exposure was made by 


, 1 Ee 
single 4” <6 
for each exposure. 
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hand, E dropping each card face down when five 
seconds had elapsed, so revealing the next card. A 
stop watch in continuous operation was used for 
timing. The pictures were small colored stencils 
of the following objects: chick, flower, barn, rooster, 
shovel, apple, pail, boat, kettle, wee, airplane, duck, 
stars, house, ball, fish, wagon, squirrel, water can, 
and pig. The words were in black %” special type 
on 2”x4” cards, and for each S a different set of 
words selected from his word association protocol 
was stapled beside the pictures on the 4” X6” cards, 
so making the picture-word pairs. The sequence 
of the pictures remained the same for all Ss so that 
half the time the pictures were paired with dis- 
turbing words, and half the time with neutral 
words. This of course applied within each experi- 
mental group as well as with the whole group of 
Ss. When the cards had been exposed once, E 
gave the following instructions: “Now I am going 
to show you the pictures alone, and it will be your 
task to give the word that goes with each picture. 
If you cannot remember the word, simply say, 
‘Don‘t know,’ and I will give you the correct word.” 

E then exposed another series of cards on which 
the pictures appeared in the same sequence but 
without the words. Each card was exposed until 
a response was given, no time limit being set. 
However, E offered the correct response if S said, 
“Don’t know,” or gave an erroneous response. 
Each response to a stimulus-picture was considered 
one trial. As S gave correct responses, the cor- 
responding cards were dropped out of the pack, and 
when S had given each response word correctly 
once, the learning period ended, and S was 
dismissed. 

S was not informed that he would be required 
to recall and relearn the words he had learned. 
The total group of 80 Ss was randomly divided 
into four recall groups of 20 Ss each. Each of the 
four groups was brought back for recall and re- 
learning at intervals, respectively, of approximately 
15 minutes (o days), 2 days, 4 days, and 7 days 
after learning. In the case of the o-day group, 
immediately after learning, S was requested to go 
to another room at some distance from the experi- 
mental room, and to wait there. He was given no 
hint that he would be asked for retention. At the 
close of ten minutes E went to this room and 
brought S back to the experimental room. Here 
S was asked to recall spontaneously (without being 
shown the stimulus-pictures) as many of the words 
from the learning task as he could. Five minutes 
were allowed for this “free” recall. S called out 
each word as he remembered it, and E noted the 
words. Following the “free” recall, the cards with 
the pictures alone were exposed one at a time, and 
S was asked to try to give the word which had 
been paired with each picture. Each response was 
considered a trial, and as a picture was responded 
to correctly it was dropped out of the series. The 
remaining cards were again shown to S, and those 
to which correct responses were given were dropped 
out of the pack. This procedure was followed until 
S had given a correct response to each of the 
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twenty stimulus-pictures. It is to be noted that 
the relearning was not preceded by exposure of 
the pictures and words together, as was the case 
in the original learning 

Ss in the 2-day group were asked, at the learning 
session, to report to the secretary of a professor of 
psychology on a given date two days later. When 
S wanted to know the reason for so reporting, he 
was informed that it was in order to procure certain 
information from him pertinent to the experiment. 
Ss in the 4-day and 7-day groups were contacted 
by telephone by E one or two days before they 
were required to report back, and a similar pro- 


cedure was followed. Upon appearing at the pro- 
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million words. Grammatically the list was 
made up of 63 words which could be inter- 
preted both as verbs and nouns, as verbs and 
adjectives, or as all three (bandage, mature, 
limp); 18 nouns (thigh, garage); 14 verbs 
(spill, soothe); and 5 adjectives (stupid, 
jolly). This list of words appears in Table r. 
Association Disturbances and Words Selected 
for the Learning 


The kinds of association disturbances 
which have been suggested or utilized in the 


TABLE 1 


Worp List SHowinc Frequency oF Each Worp Per MILLION IN THE WRITTEN LANGUAGE 








. bust, 13 26. applause, 13 

zero, 11 7. perfume, 21 

muddy, 11 28. fuss, 11 

. upset, 20 

. smash, 12 

. shy, 21 

. orchestra, 10 

. accent, 16 

. slap, 13 

. caution, 15 

. thirst, 23 

. clumsy, 10 

. stool, 16 

. bleed, 16 

. thigh, 13 

e kite, 10 
scrub, 14 » Sar, 17 
tempt, 23 . salute, 22 

. limp, 15 . torture, 25 
barber, 16 . jolly, 22 

. hoist, 10 . tackle, I 

wrinkle, . jerk, 18 

. suicide, 11 


sneak, 10 

punch, 17 

. bandage, 14 
fiction, 14 

. melody, 15 

. hug, 15 

0. crutch, 10 
. dirt, 21 
shove, 15 
. bomb, 13 
. scent, 24 

. ip, 19 

+ Quart, 17 

" 


Aue whe 


ue © on 


w 
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. cork, 11 
50. whine, 11 


. leisure, 19 
7. stammer, 12 
. tangle, 11 

. revolt, 22 

. sap, 18 

. timid, 15 

. ink, 20 

. tame, 25 

. Spit, 13 

. flush, 2 

. sniff, 15 

. dump, 10 

. sweat, 19 

. Virgin, 17 

. jelly, 19 

. garage, 14 
. squeeze, 18 
. perch, 23 

. monkey, 23 
. wink, 20 
. toilet, 11 
. excel, 11 
. scold, 19 
9. mature, I 


51. phone, 20 
52. pinch, 20 
53. grease, 16 
54. sprinkle, 24 
55. darn, 12 
56. insult, 21 
57. debate, 24 
58. tickle, 10 
59. bully, 10 
60. kneel, 19 
61. swear, 21 
62. stumble, 25 
63. lick, 22 
64. pirate, 12 
. suck, 15 
. threat, 17 
. dwarf, 25 
. tease, 14 
. disgust, 21 
. insane, 10 
. sponge, 12 
» Woo, I2 
3, ram, 16 
. spill, 12 


. soothe, 16 100. cripple, 1¢ 


y 
} 





fessor’s office, S was sent by the secretary to the 
experimental room where he was given the reten- 
tion tests. Ss in general appeared to have been 
surprised at being brought back for the retention 
tests, and it may be assumed that there was little 
or no review of the learned material. 


Selection of Words for the Association Test 


In most clinical uses of the word associ- 
ation test, no serious consideration has been 
given to equating the words on familiarity 
to S. Since in the present study S was re- 
quired to learn some of the words of the 
association list, it was deemed important to 
equate the words on familiarity. For this 
reason 100 words were selected from The 
Teacher's Word-Book (17), within the fre- 
of 10 to 25 occurrences per 


quency range 


association test are numerous (17). 
There are certain criteria of disturbance, 
however, which are fairly easily detected and 
recorded, and which have been found to 
have greater reliability than others (3). 
These are: repetition of the stimulus-word, 
apparent misunderstanding of the stimulus- 
word, overlong reaction time, failure to 
reproduce the response word in the repro- 
duction test, and clang reaction. Only these 
disturbances were recorded for each S, and 
in selecting disturbing words for the learning 
task, words with two or more of these dis- 
turbances were given first preference, after 
which words with only a single disturbance 
were selected. In the latter case, the order 
of preference was that given above. 
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The criterion for selection of the neutral 
words was: that word in each successive 


group of ten words in the association test 
which had the lowest reaction time and no 
association disturbances. The two groups of 
words were independently randomly incor- 
porated into the learning list for each S. 


Experimental Design 

For each S three measures were secured. 
These were: 

1. Number of trials in the original learn- 
ing of disturbing and neutral words to the 
criterion of one correct response to each 
stimulus-picture. 

2. Number of disturbing and neutral 
words recalled in the “free” recall period. 

3. Saving score for disturbing and neutral 
words, this being the number of trials re- 
quired for original learning of each type of 
word, minus number of trials required to 
relearn the words.® 

The recall data were treated by analysis of 
variance, the design being considered one 
of treatments (type of word) XintervalsX 
replications. The two main factors in the 
experiment were type of word (disturbing 
and neutral) and the time interval between 
the original learning and the retention tests. 
Since each S was exposed to both types of 
words, the desired comparisons were all 
intrasubject ones. Hence, in the analysis, the 
intersubject comparisons were disregarded, 
and only the intrasubject data were utilized. 

The total variance in the retention data 
was analyzable into the following sources: 

1. Retention intervals. 

2. Error term for between subjects. ‘This 
was made up of the variance due to replica- 
tions, and to the interaction of replications 
and intervals. 

3. Type of word (with or without associ- 
ation disturbances). 

4. Interaction of type of word and reten- 
tion interval. 

5. Error term for within subjects. This 
was made up of the variance due to inter- 
action of type of word and replications, and 
due to interaction of type of word, replica- 
tions, and intervals. 

6 In this paper only the recall score data are reported. 
Recall and saving score trends were alike in almost every 


respect, and all statements which could be made about 
the one type of score applied equally well to the other. 
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The sources of variance pertinent to the 
intrasubject comparisons were type of word, 
interaction of type of word and intervals, and 
error term for within subjects. The effects 
of interval alone, and of the error term for 
between subjects, being intersubject effects, 
were of no concern in the present experi- 
ment. The error term for within subjects 
was the interaction effect of type of word 
subjects (replications) for each interval, 
pooled for all intervals. 


REsuLTs 
Number of Trials 
With respect to number of trials required 
to learn the disturbing and neutral words to 
the criterion, a ¢test for related measures 
(since each S learned both disturbing and 
neutral words), showed that the difference 
was significant between the .05 and .o1 levels, 
disturbing words taking longer to learn. The 
t-test is shown in Table 2. 


TABLE 2 
LEARNING OF DisTURBING AND NeutraL Worps 
(N=80) 








TriatsTo TRIALS TO 
LEARN LEARN MEAN 
DisrurBING NeuTRAL DIFFERENCE 
Worps Worps 





19.06 1.19 
7-57 +54 


M 20.25 
SD 7.52 





Trends in Recall of Words 
Table 3 shows the mean recall of words 
with and words without association disturb- 
ances at the various time intervals after 
learning. 
TABLE 3 
RecaLtt or Distursinc AND Neutrat Worps 


(N for each interval=20) 








Worps with Worps wirHout 
ASSOCIATION ASSOCIATION 
DisrurBANcEs DisTURBANCES 


TIME 


INTERVAL 





-45 6.70 
.02 1.27 
.40 .95 
-53 .69 
30 .05 
-66 -28 
-35 -20 
.56 .60 
63 -48 
-24 -40 


o Days 


2 Days 


mes 
oxox 


Days 


7 Days 


ZSESE 


All groups 


NWF N HN HN 


ny 
— 
~ 
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The hypothesis tested was that the popula- 
tion recall values of the means for words with 
association disturbances and for words with- 
out association disturbances were identical 
for each interval of recall. This fundamental 
hypothesis was divided into two others, each 
of which was tested separately. 

The first subordinate hypothesis was that 
the differences in values of the population 
means for words with association disturb- 
ances and words without association disturb- 
ances were the same (not necessarily zero) 
for all This test of the 


hypothesis of “no interaction” between the 


intervals. was a 
two trends. 

Referring to Table 4 it may be seen that 
the F ratio between the interaction mean 
square and the error term mean square was 


The hy- 


insignificant, being less than one. 


pothesis of “no interaction” was therefore 


retained. 


TABLE 4 


ANALysis OF VARIANCE oF RecaLt Scores 


SouRCcE SS df Ms P 





Type of word +90 I 
Interaction 

(Type of word 

X interval) 2.60 


Sever term 118.50 





The second subordinate hypothesis regard- 
ing the trends was that the mean of the dif- 
ferences in population recall means of the 
two types of words for all intervals was zero. 
This a test of the difference between 
the general means for disturbing and neutral 


was 


words, the F ratio being formed between the 
mean square for type of word and the mean 
square for the error term. Reference to Table 
4 shows that this F was also less than one 
and hence insignificant. 

On the basis of the two F 
above, the hypothesis that the population re- 


tests described 


call values of the means for words with asso- 
ciation disturbances and for words without 
asssociation disturbances were identical for 
each interval of The 
trends for recall of disturbing and neutral 
words did not differ significantly. 


recall was retained. 
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Trends in Recall of Disturbing and Neutral 
Words with Equal Numbers of Trials to 
the Learning Criterion 


For each S the five disturbing words and 
the five neutral words that had taken the 
smallest number of trials to learn were se- 
lected. The means of the number of trials 
to the learning criterion for these groups of 
words, for the twenty Ss in each recall inter- 
val, are shown in Table 5. 


TABLE 5 


Triats To Learn Easy-ro-Learn DisturBING AND 
Easy-To-Learn NeutraL Worps 
(N for each interval=20) 








NEUTRAL 
Worps 


DisTURBING 


INTERVAL “ 
Worps 





o Days M 7.05 -05 
SD 3.40 .62 
2 Days M -70 .60 
SD 1.31 83 

Days M 6.10 .00 
SD 1.67 -24 

7 Days M 6.05 -10 
SD 1.36 -4! 

All groups M 6.23 -19 
SD 2.17 -14 





From Table 5 it may be seen that easy-to- 
learn disturbing words and easy-to-learn neu- 
tral words took approximately the same num- 
ber of trials to reach the learning criterion. 
Table 6 shows the mean recall of words in 
each of the categories, and Table 7 con- 
tains an analysis of the difference in trends 
of recall of easy-to-learn disturbing words and 
easy-to-learn neutral words. The F ratio be- 
tween the mean squares for interaction and 
error term was 1.28, which for 3 and 76 de- 
grees of freedom was insignificant. The F 
ratio between mean squares for type of word 
and error term was less than one and hence 
insignificant. The analysis in Table 7 dem- 
onstrates that where disturbing and neutral 
words had approximately equal numbers of 
trials to the criterion there was no difference 
in recall trends for the two types of words. 


Discussion 


Examination of the initial learning scores 
for disturbing and neutral words showed a 
significant difference in the number of trials 
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TABLE 6 


Recatt or Easy-To-Learn DisturBiNG AND Easy-To- 
Learn Neutra Worps 
(N for each interval=20) 


Easy 
NEUTRAL 
Worps 


Easy 
DisTURBING 
Worps 


Time 
INTERVAL 





o Days 2.50 2.95 
«92 1.02 
2 Days ] 1.30 1.10 
1.05 1.09 
.85 .85 
-79 -73 
7 Days ] -90 .80 

+70 .98 
All groups } 39 -43 

.09 31 


4 Days 





TABLE 7 


ANALYSIS OF VARIANCE FOR RECALL oF Easy-To-LEARN 
DisturBING AND Easy-To-LEARN NeuTRAL Worps 





SOURCE SS df MS F Pp 





Type of word .06 I .06 -09 n.s. 


Interaction 
(Type of word 
X intervals) 2.47 3 .82 1.28 


48.98 76 -64 


Error term 





required for the learning of the two types 
of words, the disturbing words requiring 


more trials than the neutral. This finding 
was in accord with repression theory explana- 
tions of disturbances on the word association 
test. 

Recall and saving scores for the two types 
of words showed no significant differences 
for any interval of recall. Since disturbing 
and neutral words differed in number of 
trials to the learning criterion, an attempt 
was made to examine the relation between 
number of trials to the criterion, and the re- 
tention scores. Comparison of the easy-to- 
learn disturbing and easy-to-learn neutral 
words, which required approximately the 
same number of trials to reach the initial 
learning criterion, showed no difference in 
the retention of these two types of words.’ 

7 Comparison of the recall of the five disturbing and 
five neutral words which were the hardest to learn for 
each S, showed that the disturbing words were in general 
better recalled, but not significantly so. The better recall 
of the hard-to-learn disturbing words followed clearly 
from the learning situation. Since these were the words 


that took the greatest number of trials to learn, they 
were the last learned in the learning series. Conse- 
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The results suggest that the repression fac- 
tor may have affected the original learning, 
but not the retention of disturbing words. 

The original hypothesis that there would 
be differences in retention as well as in learn- 
ing for the two types of words was based on 
reasoning such as Sharp’s (15) that repres- 
sion is an active process which takes place 
over a period of time. This involved the idea, 
in the present study, that the relative 
strengths of response tendencies for the two 
types of words existing at the close of learn- 
ing would change with the passage of time. 
Such changes could come about, in the case 
of the disturbing words, as a result of 
spontaneous recovery of incorrect competing 
responses, or as a result of anticipatory repe- 
titions of the learning situation due to stimu- 
lus generalization, with reinforcement of the 
incorrect (anxiety-avoidant) responses. 

If the original hypothesis of this study, that 
retention differences for disturbing and neu- 
tral words would appear with time, was justi- 
fied, clear retention differences should have 
been obtained, at least for the 7-day group. 
The failure to obtain such differences raises 
the possibilities (@) that the anxiety associ- 
ated with the disturbing words was reduced 
to impotence in the learning trials, or (4) 
that it was not a major factor to begin with. 

Anxiety associated with the disturbing 
words could have been reduced in the learn- 
ing task by the occurrence of a strong rein- 
forcement on the anxiety-related but correct 
response. If the effect of such a reinforce- 
ment was to reduce permanently the anxiety 
associated with the correct word, this word 
would have tended to reappear quickly in the 
retention tests since the competing responses 
due to anxiety would have been eliminated 
with the anxiety. 

In considering what else beside anxiety 
could have resulted in differences in learn- 
ing between disturbing and neutral words, 
the responses made by all Ss to the disturb- 
ing and neutral words which had been used 
in the learning were examined in detail. 

Table 8 contains a tabulation of the words 
which appeared most frequently among the 





quently, they were the best retained. Had there been 
any reversal of this expected retention trend, there would 
have been ground for assuming that the inherent nature 
of the disturbing words themselves was affecting the 
retention. 
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TABLE 8 


Worvos Wuicu Appearep E1cHt orn More Times AMONG THE DisTURBING AND THE Neutrat Worps FoR 











NUMBER OF 
DIFFERENT 
ASSOCIATIONS 


FREQUENCY AS 
DisTuRBING 
Worp 


DisTURBING 
Worps 


NUMBER OF 
DirFERENT 
ASSOCIATIONS 


FREQUENCY AS 
NEUTRAL 


Worps Worp 





50 
50 
50 
42 
25 21 
23 31 
50 
30 
19 
3I 
31 
40 
15 26 
15 24 
15 30 
14 40 
13 4! 
13 36 
13 47 
12 2 
12 2 
12 33 
II 33 
It 34 
11 41 
13 31 
It 34 
19 
23 
28 
29 
45 
29 
35 
35 
27 


tempt 
threat 
insult 
disgust 
hug 
limp 
tease 
shy 
woo 
bust 
coward 


37 
31 
30 


17 
17 
I5 
15 
upset 15 
whine 
phone 
clumsy 
tangle 
jerk 
lick 
scold 
scent 
caution 
grease 
bully 
insane 
soothe 
tumid 
cripple 
hiction 
bleed 
ram 
squeeze 
snc ak 
crutch 
wrinkle 
accent 
pinch 
excel 
torture 39 


spill 28 


stumble 
garage 
shove 
rip 
hoist 
dwarf 
swear 
sniff 
darn 
punch 
zero 
stupid 
thirst 
kite 
suicide 
sprinkle 
woo 
timid 


31 II 
29 10 
27 10 
25 7 
24 23 
19 16 
18 25 
18 20 
17 21 
16 28 
15 19 
15 18 
14 9 
14 22 
14 26 
14 19 
14 19 
14 31 
muddy 13 24 
melody 13 9 
ink 13 18 
sponge 12 21 
leisure 12 30 
monkey 12 20 
slap 11 29 
cork II 15 
grease II 33 
wink 11 15 
squeak 10 25 
perfume 10 15 
stool 10 17 
phone 10 24 
debate 10 18 
sap 10 26 
scrub 9 14 
fuss 9 35 
orchestra 9 12 
stammer 14 
bleed 23 
salute 26 
ram 28 
virgin 18 
squeeze 29 
mature 16 





1324 
39 





in associations 


33-95 


888 
44 
20.18 


Total 
N 
Mean associations 





* For each of the 80 Ss ten disturbing and ten neutral 
each type. 


ten disturbing words for each S and among 
the ten neutral words for each S. For all 80 
Ss there were totals of 800 disturbing and 800 
neutral words used in the learning and re- 
tention The table accounts for 572 
of the disturbing words and 6or1 of the neu- 
tral words. This table also contains a tabula- 
tion of the number of different *® responses 


tests. 


8 Responses which contained the same root but dif- 


fered in grammatical form were considered the 


1 
ony 


words were used in the learning, making totals of 800 of 


to these words made by the 80 Ss in the word 
association test. 

There is a striking difference between the 
two types of words in the number of different 
associations the 80 Ss gave to them. Most Ss 
gave the same responses to the neutral words, 
but considerable variety appeared in the re- 
sponses to the disturbing words. If the situ- 





same response. Thus, “fall,” “falling,” and “fall down,” 
were considered the same response. 
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ation for all Ss together could be taken to 
represent roughly the situation for the aver- 
age S, then the implication is that for the 
neutral words there were relatively strong 
dominant response tendencies, while for the 
disturbing words there were no single dom- 
inant response tendencies. The availability 
of a dominant, conventional response would 
have led to a smooth response when the stim- 
ulus-word was given in the word association 
test. However, the lack of such a dominant 
response would have led to a disturbance in 
the response because of the competition of 
response tendencies of equivalent strength. 
The learning difficulty of the disturbing 
words may also have been a reflection of the 
unavailability of dominant conventional as- 
sociations to these words. Lack of such con- 
ventional associations may have made it diffi- 
cult for S to make mnemonic connections 
between the words and the stimulus-pictures. 
On the other hand, in the case of the neu- 
tral words, the availability of common asso- 
ciations may have made it easier for S to 
utilize mnemonic techniques, permitting him 
to learn these words faster. This analysis 
of the linguistic features of the word associa- 
tion test does not carry the implication that 
retention differences for disturbing and neu- 
tral words should also have been expected. 

In the light of the above arguments, further 
study is required to determine whether dis- 
turbances on the word association test are 
due to the presence of “complex” or anxiety 
factors associated with the disturbing words, 
or to linguistic characteristics of these words. 


SUMMARY 


The Jung modification of the word asso- 
ciation technique, which is the prototype of 
that used in the present day clinical prac- 
tice, is based on the assumption that “com- 
plex” indicators or association disturbances 
on the test indicate that the stimulus-words 
concerned are in some manner associated 
with repressed, anxiety-related material. In 
accordance with repression theory one would 
predict that stimulus-words with association 
disturbances would be more difficult to learn 
and more readily forgotten than words with- 
out such disturbances. 

The hypotheses tested in the present study 
were: 
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1. There is no difference in the number of 
trials required to learn words with association 
disturbances and words without association 
disturbances to a given criterion. 

2. There is no difference in the retention 
of words with association disturbances and 
words without association disturbances over 
varying periods of time. 

Eighty college males were given a 100-word 
association test. Ten words with association 
disturbances, and ten words without associ- 
ation disturbances were selected for each S, 
and these words were arranged in a list, each 
word being paired with a picture. S learned 
to give the word in response to a presentation 
of the picture. Separate groups of 20 Ss each 
were called back at intervals of 15 minutes, 
2 days, 4 days, and 7 days after learning, and 
asked to recall as many of the learned words 
as they could in a free recall period lasting 
five minutes. Following this, each S was 
required to relearn the paired associates. 

It was found that disturbing words took 
significantly more trials to learn than neu- 
tral words. The data suggested that there 
was no difference in retention of disturbing 
and neutral words which could be attributed 
to the disturbance factor. 

The repression explanation of disturbances 
on the word association test was supported 
by the finding that disturbing words took 
significantly longer to learn than neutral 
words. Possible explanations of the failure of 
clear retention differences for the two types 
of words to appear were: 

1. The anxiety associated with the disturb- 
ing words, which was responsible for the 
initial difference in learning, was perma- 
nently reduced in the learning, and hence 
did not influence the retention measures. 

2. Anxiety was not a major factor in pro- 
ducing competing responses in the original 
learning of the disturbing words. 

Further research on the nature of the dif- 
ferences between disturbing and _ neutral 
words is required. 


REFERENCES 


1. Crostanp, H. R. The psychological methods of 
word-association and reaction-time as tests of de- 
ception. Publ. Univer. Oregon, Psychol. Series I, 
1929, 1-104. 

2. Freup, S. Psychopathology of everyday life. In 
A. A. Brill (Ed.), The basic writings of Sigmund 
Freud. New York: Modern Library, 1938. 








462 Jutrus Larrar 


3. Hutt, C. L., & Lucorr, L. S. Complex signs in 
diagnostic free association. ]. exp. Psychol., 1921, 
4, 111-136. 

. Husron, P. E., SHaxow, D., & Erickson, M. H. 
A study of hypnotically induced complexes by 
means of the Luria technique. j. gen. Psychol, 
1934, 11, 65-97. 

. Junc, C. G. Studies in word-association. M. D. 
Eder (Trans.), London: Heineman, 1918. 

. Junc, C. G. The association method. Amer. J. 
Psychol., 1910, 21, 219-269. 

. Kons, S. C. The association method in its relation 
to the complex and complex indicators. Amer. ]. 
Psychol., 1914, 25, 544-594- 

3. Linpeuist, E. F. Goodness of fit of trend curves 
and significance of trend differences. Psycho- 
metrika, 1947, 12, 65-78. 

. Linpguist, E. F. Notes for a course in design of 
experiments in education and psychology. State 
Univer. of Iowa (Mimeo.), 1950. 

. Luria, A. R. The nature of human conflicts. W. H. 
Gantt (Trans.), New York: Liveright, 1932. 


Lyncn, C. A. The memory value of certain alleged 
emotionally toned words. ]. exp. Psychol., 1932, 
15, 298-315. 

. Rapaport, D., Girt, M., & Scuarer, R. Diagnostic 
psychological testing. Chicago: Year Book Pub- 
lishers, 1946. V. IL. 

. Rapaport, D. Emotions and memory. Baltimore: 
Williams and Wilkins, 1942. 

. Sears, R. R. Functional abnormalities of memory 
with special reference to amnesia. Psychol. Bull., 
1936, 33, 229-274. 

. SHarp, Acnes A. An experimental test of Freud's 
doctrine of the relation of hedonic tone to memory 
revival. J. exp. Psychol., 1938, 22, 395-418. 

. Smirn, W. W. The measurement of emotion. New 
York: Harcourt, Brace, 1922. 

. Tuornoike, E. L., & Lorce, 1. The teacher's word- 

book of 30,000 words. New York: Columbia 

Univer. Press, 1944. 


Received by this JournaL, August 10, 1951. 





MORAL BEHAVIOR AND MORAL JUDGMENT OF CHILDREN 


REFIA UGUREL-SEMIN 2 


’ 
Uniwersity of Istanbul 


being is subjected to the influence of a 

social environment as well as a physical 
one. This social environment not only obliges 
him to recognize certain facts, but also sup- 
plies him with a language and a sense of val- 
ues and, at the same time, imposes rules upon 
him with which he must endeavor to com- 
ply within the limits of his abilities. 

The most profound study carried out on 
the development of moral judgment in the 
child is that by Piaget who has endeavored 
to interpret the child’s concept of moral 
rules (1, 3). 

From studying the rules of play and those 
of moral judgment pertaining to stories, 
Piaget postulates two mora! systems: (a) 
coercive pressure exerted by rules, and (4) 
free cooperation and respect for the rules 
based upon mutual understanding. Piaget 
did not deal with moral judgment in the 
light of a moral act, but he proposed the 
hypothesis that the child would demonstrate 
moral judgment in action far earlier than in 
speech. 

The writer felt that by placing the child 
in a moral situation less delicate than either 
lying or stealing, for example, the experi- 
mental method might be used to test the 
hypothesis. Using children of both Geneva 
and Istanbul, the writer examined the de- 
velopment of the concept of physical law 
governing the seesaw principle and the un- 
derstanding of that principle, as well as the 
mutual relationship between development 
and understanding (4). The study showed 
that understanding of the physical environ- 
ment is acquired in light of experience and 
that there are types of thought characteristic 
of the intelligence in its developmental stages. 
Similarly, the social environment leads to in- 
teractions between the developing individual 
and his immediate circle. 

Within this framework an attempt was 


Fi: the ume of his birth every human 


1 The writer wishes to express her gratitude to Dean 
E. R. Hilgard for reading the manuscript of this paper 
and for his editorial help in preparing it for publication. 
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made to answer the following questions in 
regard to generosity: (@) What are the types 
of moral behavior as related to age? (6) Is 
there a difference in moral conduct according 
to sex? (¢) What effect does the socioeco- 
nomic group to which the child belongs have 
upon his moral behavior? (d) What effect 
does belonging to a family of one, two, or 
more children have upon the moral behavior 
of the child? (¢) In what way is moral be- 
havior related to moral judgment? (f) 
What are the existing types of moral judg- 
ment, and what is their evolutionary process? 
(g) What similarity is there between the 
adaptation of the mind to the physical en- 
vironment and to the social environment? 
(A) What relationship exists between moral 
judgment and moral action? 


Experimental Procedure 


The experiment required a child to divide 
an unequal number of nuts between himself 
and another child. Different pairs of chil- 
dren divided from § to 15 nuts. 

The study was begun in 1942. Two hun- 
dred ninety-one children between the ages of 
4 and 16 attending the kindergarten and 
primary schools of Istanbul were subjects (Ss) 
in the experiment. Children are normally 
admitted to primary school at 6 to 7 years 
of age, and they leave at 11 to 12 years. The 
few Ss older than 12 were those who started 
school rather late. 

In each operation of the experiment, two 
children were taken into a room and seated 
at a table, with the experimenter (E) sit- 
ting opposite. The nuts were placed on the 
table and subject A was told that he was to 
share them with subject B. But, before A 
began to speak, subject B was requested to 
leave the room and wait outside until called. 
Subject A was then asked how he was going 
to share the nuts with B upon the latter’s 
return. 

Subject B was then called in, and A was 
made to perform the sharing act and be 


ee ee 


eneeh omen anata 
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questioned in front of B. The following is an 
example of a typical procedure: 


A (6 years, 1 month) and B (5 years, 10 months) 
are sitting at the table. The E places g nuts on the 
table and says, “A, you see the nuts? You are going 
to share them with B. B, will you leave the room 
for a moment and wait outside until I cail you.” B 
goes out. “A, how are you going to share them?” 
A replies, “Half for me and half for him.” B is 
in and sits down. “B has come back,” E 
remarks. “What are you going to do?” “I will give 
him half and keep half for myself.” He counts the 
nuts. “There are four for me and four for him, if I 
leave this one out.” “What are you going to do 
now?” E asks. “T'll leave it out. Four for him and 
four for me.” “But,” E suggests, “instead of leav- 
ing it out, put it where you like.” First A adds it 
to his pile, then brings it near to B’s pile and touches 
it. Finally he keeps it for himself. “Is that all 
right?” he asks. “I don’t know; perhaps it is,” he is 
told. “I know that it’s all right like this,” A says. 
“Why is it all right?” “Because it is a sharing as 
between brother and sister.” 


called 


Some children took as long as 10 minutes 
to decide how to divide the nuts; others 
shared them equally, requesting the experi- 
menter to keep the remaining nut for a sister 
who was at home. And there were some 
who carefully felt each one in turn to make 
sure that they picked the best for themselves 
or their companions. 


REsuLts 
What are the types of moral behavior as 
related to age? 
Considering the moral conduct of children 


within the limits of the experiment, three dis- 
tinct categories will be found (Table 1). Of 
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TABLE ! 


Morar Benwavior oF Susyects REGARDLESS oF AGE, 
Sex, Economic Stratus, or Famiry Size 


Groups ee 
e N 


/o 





Those who give less (selfish) 14 40 
Those who share equally (equalitarian) 123 
Those who give more (generous) 44 128 

Total 291 





the total number, only 40 (14 per cent) may 
be classified as selfish, keeping more than 
they give, whereas those who are generous 
or share equally are of higher and almost 
the same percentages (44 per cent and 42 
per cent, respectively). 

Concerning the change in moral behavior 
with maturation, it is apparent that the 
selfish tendency diminished with age (Table 
2). Between the ages of 4 and 6 years, this 
tendency appears to be at its zenith (66 to 67 
per cent), diminishing noticeably until it 
completely disappears at 12 years. There- 
fore, from the findings in this experiment, we 
can say that children act selfishly until 6 
years of age. 

Generosity increases after 5 to 6 years and 
reaches 63 per cent at 7 to 8 years, at which 
period the child is the most apt to be gen- 
erous. After this age the tendency fluctuates 
somewhat but tends to remain below equali- 
tarian decisions. 

It is interesting to note the coincidence of 
the age of generosity with the decline in 
egocentrism. Thus, it is at the same age 


TABLE 2 


FREQUENCY 


GENEROI 


N 


or Generous, EQUALITARIAN, AND SELFISH CuHorces AccorDING To AGE 


EQUALITARIAN 


SELFISH 


N NuMBER OF CASES 
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that the child rids himself of egocentrism in 
the realm of thought and of selfishness in 
the realm of moral behavior. 

The tendency to give an equal share 
evolves progressively until 11 to 12 years 
where it reaches 68 per cent. It dominates all 
other tendencies at most ages beyond 8. 

Figure 1 shows, beyond points discussed 
above, a particular phenomenon beginning at 
II to 12 years. From this age onward the 
tendency toward equality and generosity in 
sharing alternate constantly: sometimes gen- 
erosity is of a high percentage and at other 
times the tendency to share equally predom- 











N. Pree 


465 


ence in moral behavior between the groups 


could be found (Table 3). 


TABLE 3 


Morar BeHavior Accorpinc To SEx 








Boys Girts 


Groups 





Generosity 
Equality 
Selfishness 


Total 




















9-10 


Fic. 1. 
Generosity 





Equality 


inates, although the small numbers at any one 
age make interpretation precarious. Accord- 
ing to Piaget’s (1) findings, the building up 
of “opérations formelles” begins at 11 to 12 
years and is more fully formed during adoles- 
cence. At this age, too, the moral behavior 
of the child is shaped by concepts of justice 
and goodness. Freed both of egoism and of 
deference to one-sided moral pressures, the 
child is able to judge the moral situation 
from different aspects. He bases his judg- 
ment upon an abstract idea, law, upon a 
common ideal shared by the society in which 
he lives, or upon the concept of goodness and 
love. This matter will be given further 
discussion in the section of this report deal- 
ing with the development of moral judgment. 


Is there a sex difference in moral conduct? 


For this experiment subjects were almost 
equally divided on the basis of sex. No differ- 


1o-t1 


i2 2-13 13-4 4-55 


FREQUENCY OF GENEROSITY, EQUALITY, AND SELFISHNESS ACCORDING TO AGES 


Selfishness 


What effect does the socioeconomic group to 
which the child belongs have upon his 


moral behavior? 


One hundred sixty-seven of the 291 sub- 
jects were classified into three groups: rich, 
middle-class, and poor. Since kindergartens 
are private in Turkey, the children attending 
them were of families able to pay for such 
schooling, hence could not be included in this 
phase of the experiment. 

Socioeconomic level of the family affected 
the behavior of the children in this experi- 
ment. In general, the poorer children were 
as often generous as the rich, more often 
equalitarian, and less often selfish (Table 4). 

The significance of the relationship between 
socioeconomic status and degree of gener- 
osity was determined through application of 
the chi-square technique. A chi-square of 20.5 
was found, with 4 degrees of freedom, the 
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TABLE 4 


Mora. BEHAVIOR 


NUMBER OF Ricu 


© 


Groups 
SUBJECTS 


ACCORDING TO 


Sratus 


Economic 


Mipp.e-Cass 


N 


° 





62 
14 
24 


Generosity 
Equality 
Selh 


nness 


Total 


19 
13 
II 


44 
30 
26 


43 





relationship being significant at better than 
the .o1 level. The magnitude of relation- 
ship as expressed by the coefficient of mean 
square contingency (C) is .33.° The equiva- 
lent r is .40. 


What effect does belonging to a family of 
one, two, or more children have upon the 
moral behavior of the child? 


Subjects were divided into three groups: 
those from families of one child, two children, 
and more than two children. 

Reference to Table 5 will show that the 
child of the larger family is slightly more 
often generous than the only child. This 
trend applies also to sharing equally (17.3 
per cent, 21 per cent, and 37 per cent, re- 
spectively). The relationship is inverse for 
selfishness (30.4, 17 and 8 per cent, respec- 
tively). 

Chi square in this instance is 11.89, with 
4 degrees of freedom, the relationship being 
significant between the .o2 and .o1 levels. 
C=.26; the r-equivalent is .30. 

In order that the factor of family size be 
stabilized in the analysis of effect of socio- 
economic status, children of one- and two- 


) 


»y the formula, 


( = 
‘—¢2 


2C was « 1 directly | 
= 


€ P hi square was computed as N 
P 


child families were eliminated, and the re- 
lationship of generosity and socioeconomic 
status was studied in the large-family chil- 
dren. 

The findings in this phase of the study 
(Table 6) are essentially the same as those 
reported above in a discussion of the relation- 
ship of socioeconomic status and moral be- 
havior in the entire sample. The same rela- 
tionship among children of large families is 
significant between the .o2 and .o1 levels. 
C=.35, and the equivalent r=.42. 

Generosity is therefore not only influenced 
by membership in a large family, but by the 
socioeconomic status of the family as well. 
It may thus be concluded that the moral 
behavior of children in this experiment is no- 
ticeably determined by the three following 
factors: 

1. The age of the child. 

2. The socioeconomic conditions in which 
the child is living. 

3. The number of children in the family 
group to which the child belongs. 


In what way is moral behavior related to 
moral judgment? 


Among children in the current study who 
shared equally and those who were generous 
there was some consistency between moral 
conduct and moral judgment. Among those 


TABLE 5 


Mora 


Groups NuMBER OF One Cuitp 


N 


SUBJECTS 


Benavior Accorpinc To Famiry Size 


Two CHILDREN 


N 





Generosity 
f qu ality 


Selfishness 


95 
49 


22 
23 


167 











Morat Benavior aND Mora JupGMENT OF CHILDREN 


who may be classified as selfish, however, two 
types of moral judgment can be distin- 
guished: (a) approval of one’s own selfish 
behavior and (4) disapproval of it. 

Among the 4o selfish children, 10 per cent 
abstained from pronouncing moral judgment 
upon their own behavior. Thirty-two per 
cent of the equal-sharing children and 9 per 
cent of the generous also declined to com- 
ment. 

Those children who approved their own 
selfish behavior may be grouped into four 
categories: 

1. Purely selfish attitude: “I gave less to 
have more for myself.” (5 years, 6 months.) 

2. Fixation in numerical perspective: The 
child understands the act of sharing in one 
way only. He gives less “because one is 
missing” (4 years, 8 months); or, “if you 


in behavior guides the moral thought, which 
seeks justification for the conduct. 

Those children who disapprove of their 
own selfish behavior may be categorized as 
follows: 

1. Vague feeling of disapproval of selfish 
behavior: The child feels that it is not good 
to keep more for himself, but cannot give 
a clear reason for doing so. “Is it right to 
do that?” “No.” “Why?” “What is the fair 
thing to do?” (5 years, 5 months; 7 years, 
10 months.) 

Sometimes this feeling of disapproval is 
shown by gestures. The child hesitates a 
long while before deciding the fate of the 
odd nut. Having felt each nut in turn and 
found them all equally good, he adds the re- 
maining one to his own share. (5 years, 4 
months; 8 years.) 


TABLE 6 


fora BEHAVIOR OF CHILDREN OF FAMILiEs OF THREE OR More CuiLpren Accorpinc to Economic Stratus 








NUMBER OF Ricu 


Groups - - 
SuBJECTS N 


Mippve-Crass Poor 
% N 





Generosity 51 


8 
Equality 34 3 
Selfishness 7 4 

5 


Total 92 I 


46.1 6 
38.5 5 
15.4 2 


100.0 13 





subtract 4 from 9g, there are 5 for me.” (7 
years, 5 months.) 

3. Confusion in point of view: The child 
cannot distinguish his point of view from 
that of the other child. “If he wants, I will 
give it to him. If he doesn’t, I shall take 
it.” The E remarks, “He isn’t saying any- 
thing. What should be done?” And the child 
replies, “It means that he doesn’t want it.” 
(7 years, 4 months.) 

4. Sociomoral precepts employed contrary 
to normal usage: The rules dictating altruis- 
tic moral behavior, such as “sharing between 
brother and sister,” “the older one gives 
more,” “the younger one gives more,” “the 
one receiving the nuts must share equally,” 
are employed by these children for a selfish 
aim. The moral precept is chosen to fit the 
accomplished act. 

The children in the first category judge 
their selfish action as being fair. In the 
fourth category to the contrary, selfishness 


2. Insufficiency of number: The child tak- 
ing the larger share acknowledges that it is 
wrong, but justifies the action by the insufh- 
ciency of the number of nuts. “Is it fair to do 
that?” the E asks. “No.” “Why?” “Be- 
cause I gave him less and took the most.” 
“What is the fair thing to do?” “To give 
him the same amount.” “So what shall we 
do?” And the child responds, “We will do 
it this way, because there are not many.” 
He keeps 4 for himself and gives 3 to the 
other child. (6 years, 4 months.) 

3. Acknowledgment of certain moral rules: 
The child says, “I must keep less and give 
more.” Nevertheless, he keeps more for him- 
self. (7 years, 11 months.) A boy of 8 thinks 
it “unfair to keep more for himself.” A little 
girl thinks that “it would be shameful and 
unfair to keep more for myself.” Yet she 
acts selfishly just the same. (6 years, 11 
months.) 

Those who disapprove of their selfish be- 
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havior demonstrate progress, in that they The types of judgment can be classified into 
show unilateral respect for the rules. Those 32 subgroups, as follows: 
who approve of their selfishness, on the other 
hand, demonstrate an incapability of judging 
a moral situation in the light of the view- Sonfu: 
point of others. The common factor to both ™4terial with the moral. 

Mee ge 7 athe rk 2. Fixation in numerical perspective. 
groups, of course, is a selfish tendency which 3. Insufficiency of number of nuts. 
dete: mines behavior. 4. Purely selfish attitude. 


I. Egocentrism 


1. Confusion of others with self; confusion of the 


What are the existing ape of moral judg- II. Sociocentrism (obedience to moral and 
ment, and what is their evolutionary religious rules and customs) 


process? . 
5. You give more to your friend. 


In each moral action group there is a cer- 6. The older one gives more (or an equal 
tain kind of moral judgment. Some moral amount). 


TABLE 7 


RELATIONSHIP OF Type oF Mora JUDGMENT To AGE AND Morat BeHavior 


SELFISH 





MepbIAN AGE GENEROUS EQUALITARIAN 








Types or Morar JUDGMENT - ; ‘ ~ 
Years Montnus N 7 N 





Egocentrism 3 3 25 
Sociocentrism 2 53 82 
Awareness of social reaction 3 13 93 
Superficial reciprocity 5 10 32 
Deeper and enlarged reciprocity; coopera- 

thon 5 It 44 
Altruism 5 27 84 
Justice 10 ee + 

Total 119 





judgments are peculiar to the behavior group 7. The younger one gives more (or takes more). 
8. It is necessary to offer to guests. 


to which they belong, while others are non- 

i -* ud lensied 9. It would not be fair to keep the bigger share. 
specific. If these judgments oS Cee 10. He who is given the nuts to divide must give 
according to type, without consideration of most. 
the action involved, a distinct curve in the 11. If you eat in front of another you must give 
evolution of moral judgment can be ob- (share equally). : 

-_ - 12. Sharing between brother and sister. 
served (Table 7; Figure 2). 13. Boys give to girls. 
14. Girls give to boys. 
15. Generosity is a good action before God. 
16. Unkindness should be replied to with a kind 


act. 
17. One should always think of others, of the 


poor. 
III. Awareness of social reaction 
18. Fear of public opinion; shame. 


IV. Superficial reciprocity 





| | 19. Materialistic point of view. 
20. Sharing between brother and sister as expres- 


j= L = 
t u " iv sion of equality. 
21. Equal sharing is the best. 


Fic. 2. Ace at Wuicu Eacu Type or Morat Jupc- 22. Equal sharing is considered necessary. 


2. 


MENT PREDOMINATES: MEDIAN AGE IN YEARS AND 23. Unequal sharing makes the child uncomfort- 
Montus (See Table 7). able. 
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V. Deeper and enlarged reciprocity; coopera- 
tion 
24. Mutual interest. 


25. Preventing a friend frora being angry. 
26. Maintenance of good relations between friends. 


. Altruism 


. Gift (sacrifice). 
§. Mutual help. 
29. Sympathy. 
. Moral joy. 
. Idealistic altruism. 


. Justice 
. Justice and right demand equal sharing. 


is important to examine each of these 
seven major types of moral judgment. 
Egocentrism. The confusion of self with 
others is displayed in two ways: either the 
child attributes all his desires to the other 
child, being unable to differentiate between 
two selves, or he identifies himself so com- 
pletely with the other child that he cannot 
distinguish between their respective wishes. 
“Who did you give the bigger share to?” “To 
B.” “Why?” “So that she wins.” “Why do 
you want her to win?” “So that she'll get 
bigger and won’t have to stay in kindergarten 


any longer. If she stays here, she can’t get any 
bigger. When she grows up she'll buy lots 


of things. She'll be an officer.” (4 years, 
8 months.) 

Likewise, the confusion of the material 
with the moral is manifested in two ways: 
the child thinks that his generosity will have 
a beneficial effect upon an important event 
in his life; he sees a cause and effect between 
two facts which are actually unrelated, such 
as the generous action and passing an exam- 
ination. Or the motivation stems from the 
child’s attributing a special power to the 
“look.” In acting generously, he believes that 
he will avoid the bad effect of the look of the 
person who watches him eat. “Now my 
friend’s eye will not remain on the nuts.” 
(10 years, 6 months.) 

In the fixation in numerical perspective, the 
child can see the problem of sharing only in 
one way: that which is beneficial to him. 
This is not only a matter of selfishness, how- 
ever, but an inability to work out the arith- 
metic problem in any other way. 

In the insufficiency of the number of nuts, 


the attitude closely resembles that of the pre- 
vious subgroup. The child claims that there 
are not enough nuts to share as he would 
wish, whereas in the previous situation the 
child insisted that a nut was missing or that 
he could see no other way of sharing them. 

The purely selfish attitude is evident when 
the child frankly takes the larger share in 
order to have more to eat. 

Sociocentrism. In this group can be found 
the expression of unilateral deference. The 
child accepts the suggestions of his environ- 
ment, to which he accommodates himself 
without criticism. He recognizes the pres- 
sures exerted by custom and moral and 
religious rules. Thus a rule may sometimes 
be cited as cause for sharing equally, some- 
times for generous behavior and at other 
times for selfishness. The same verbal ex- 
pression may be used as a basis for different 
kinds of sharing. Its use depends on the 
attitude of the particular child. 

In an examination of sociocentrism in 
moral judgment, it is striking to note the 
complexity and variety of the pressures 
exerted upon the child. The thirteen differ- 
ent rules which form subcategories in this 
area emanate from customs, moral rules, and 
religious attitudes. 

Table 8 gives the frequency of each type 
of rule as the basis for a moral judgment. It 
will be seen from the totals at the bottom of 
the table that equal sharing is least influenced 
by any rule (5 per cent), since equality is the 
very expression of reciprocity. Next comes 
the selfish group (14 per cent). And the 
generous children comprise the group most 
influenced by rules (81 per cent). 

Among all the children citing a moral rule, 
the greatest pressure appears to be exerted by 
the schoolmate group (28 per cent). Respect 
toward older persons also plays an important 
part in the child’s life. The attitude toward 
older children (22 per cent) and younger 
ones (9 per cent) is proof of this fact. Next 
come the sociomoral pressures, such as the 
idea of being generous to guests (8 per cent)*, 

8 It was found that the children considered themselves 
as though they were receiving guests. During the present 
study, the pupils of another class joined the class from 
which Ss were taken. Therefore, two children belonging 


to two different classes took part in many of the experi- 
mental situations. The child dividing the nuts acted 
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believing generosity necessary regardless of 
the circumstances (9 per cent), acceptance of 
the idea that the action of dividing obliges 
the child to act with generosity or equality 
(6 per cent), acceptance of the idea of sharing 
as between brother and sister (5 per cent). 
The rule, “If you eat in front of another you 
must give,” emanating directly from a social 
custom, does not appear to influence these 
children greatly (3 per cent).* Purely moral 
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pressures, such as “Unkindness should be 
replied to with a kind act” and “One should 
always think of others,” appeared to influence 
only 3 per cent of the children. And the 
influence of religion and politeness is the 
least (1 per cent). 

It will be noted that the child is subjected 
to different social pressures in a given order, 
proportionate to the possibility of contact 
that the child has with certain groups. The 
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greatest pressure is exerted by the environ- 
ment to which he is Thus, the 
schoolmate group is the closest to the child 
and exerts the most influence. The pressure 
exerted by the adult group takes second place, 
being followed by society, the widest group. 
In relation to the child, these groups become 
concentric, the one nearest to him occupying 
the central position. Unilateral moral and 
religious influences and politeness appear last. 

Awareness of social reaction. In examin- 
ing feelings in this category, it will be noted 
that they are an aspect of the concentric pat- 


closest. 


tern mentioned above rather than being rep- 
resentative of a separate period. When the 


This social custom springs from the idea that he who 


} 


hers are eating will be unhappy and 
that his desire for it will make him ill. Sometimes it 1s 
believed that “the eye of him who watches you cat will 
remain upon you.” This “look” may badly affect the 
person eating. To counteract the effect, you give to 
those who watch you eat. 

In old districts of Istanbul elderly women may be seen 
who keep their provisions in a nontransparent sack 
that another's eye may not affect them. 


cannot buy what ot 


so 
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child responds in conformity with a moral 
rule understood in a unilateral sense, he feels 
that an external power approves or disap- 
proves his conduct. Fear and shame are 
emotional factors in respect for the moral 
rule: “It would be shameful to keep the 
bigger share.” “Why?” “Because that would 
be greedy.” (12 years, 7 months.) Social 
reaction, then, plays the part of one’s con- 
science. Its place will be taken by the moral 
conscience when the individual reaches moral 
autonomy. 

Superficial reciprocity. It is only from the 
age of g years, 5 months that one notices a 
sentiment of reciprocity in children, but it 
remains superficial in the beginning. The 
child no longer judges moral action only 
from his own viewpoint or that imposed by 
society, but now becomes capable of judging 
it from two points of view: that of the person 
doing the sharing and that of the person 
receiving. But moral appreciation remains 
within these limits and does not extend to 
the motives. 

Reciprocity is observed in the act of equal 
sharing when the child’s understanding is 
based upon “sharing between brother and 


sister,” “equal sharing is the best,” “equal 


sharing is necessary,” “unequal sharing 
makes the child uncomfortable.” 

In the materialistic point of view, the child 
not only considers the sharing act from two 
sides, but also foresees the effect of his moral 
action from a strictly materialistic angle. 
Thus, the child will give more to ensure that 
the other child eats more. Or he considers 
that the effect of half a nut upon physical 
growth is insignificant, hence acts generously. 
Again, he thinks that the stomach of the 
bigger person has a greater capacity and 
therefore gives him more. Another reason 
put forward for generosity is the impossi- 
bility of dividing one nut equally. 

Deeper and enlarged reciprocity; coopera- 
tion. The child becomes capable of foresee- 
ing the effects of his action in the light of 
social necessity, and he behaves accordingly. 
He can look back upon a past action which 
produced a certain effect upon others. The 
child makes the moral decision in conformity 
with the principle of mutual interest: “My 
friends are always useful (good) to me; so 
am I to them” (7 years, 6 months). He will 


prevent a friend’s being angry: “If I take 
more, he will be angry” (10 years, 6 months). 
He will do his utmost to maintain good 
relations between friends: “It is for good 
friendship that I am giving him more. I 
give them to him so that there is no quarrel” 
(13 years, 8 months). To cooperate becomes 
the very expression cf social necessity, and 
cooperation here requires generosity. It is 
by this freely accepted rule that children 
cooperate among themselves, and it can be 
said with Piaget that the moral rule is a 
condition and at the same time a result of 
cooperation in this period of autonomy. 

Altruism. With the different forms of 
altruism, understanding extends to the 
motives of the moral action and to S’s state 
of mind. Altruism may find expression in 
motives such as sympathy: “I give it to him 
to show him how much I like him” (12 
years). The subject may make a sacrifice 
out of respect for the sentiments of others: 
“If I take a lot his feelings will be hurt” (11 
years). Understanding of the subject’s state 
of mind may find expression in sympathy 
and moral joy: “When I give only a little to 
my friend, it upsets me. I will eat only a 
little; it makes me happy” (11 years, 6 
months). 

Justice. Justice may embody either of two 
kinds of relationship: a temporal one, in 
which the present is linked with the future, 
or a relationship between a concrete situation 
and an abstract idea. In the former relation- 
ship, judgment stems not only from a realiza- 
tion of the present moral situation, but also 
from recognition of the temporal relationship 
of the present situation to future events. 
Altruistic idealism offers an example. The 
child will behave generously to his friend, 
because they will work for the same ideal 
when they are older: “When you give more, 
rights are not respected. Only, when we are 
older, we will work for the same country. 
We will work for the same aim” (10 years, 
5 months). 

In the second type of altruistic relationship, 
the child sees his concrete moral action in 
light of abstract principles of justice and 
ight: “No more, no less; we take what is 
our right.” 

The progress of moral thought is charac- 
terized by five different tendencies whose 
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common trait is found in the change from 
centralization to decentralization. 

Moral thought moves (a@) from external 
consideration of the moral situation at hand 
toward internalization of moral understand- 
ing, (6) from being linked to the present 
moment toward consideration of life as a 
whole, (c) from a consideration of a specific 
connection to a linking up of various con- 
nections, (¢@) from an individual and per- 
sonal consideration of the moral action 
toward reciprocity and cooperation, and (e) 
from unilateral consideration of the moral 
rule toward its mutual understanding. (The 
latter two traits have been clearly explained 
by Piaget.) 

An examination of these five traits of the 
development of moral thought reveals their 
underlying common trend from centraliza- 
tion to decentralization. During the first 
phase, centralization is evident in th rela- 
tionship of the self to its environment. Since 
the child is unable to differentiate between 
the self and others, egocentrism is the be- 
havioral thought pattern. Fixation in nu- 
merical perspective is not only an intellectual 
inability to find different solutions for a 
given problem, but demonstrates as well that 
the subject is bound by an egotistical tend- 
ency. Thus, at the age of 7 years, 3 months 
the child bases his moral judgment upon a 
relationship which is the result of a centrali- 
zation of thought in the activity itself. 

In a second phase, sociocentrism is observ- 
able because the moral judgment of the child 
is the reflection of the pressures to which he 
is subjected. The child obeys rules imposed 
by society without being able to discuss them. 
Thus, moral thought is centralized in social 
pressures. 

Later, a decentralization occurs in the 
child’s view of the self, of his own activity, 
and of the moral rule understood in a uni- 
lateral sense. The appearance of reciprocity 
shows that the child is able to distinguish 
his point of view from that of others, that he 
can decentralize his thoughts from the self. 
At first, this reciprocity is superficial and 
materialistic. The foreseeing of the effects 
of his moral action shows that the child is 
capable of looking back in order to adjust 
his moral action in the present. This looking 
back makes a greater reciprocity possible: “My 
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friends are good to me; I am good to them.” 
“They are useful to me; I am also useful to 
them.” To live peaceably, to avoid making 
others angry or jealous, briefly, to maintain 
good relations with others, the child will act 
generously or share equally. The moral rule 
of acting generously or sharing equally be- 
comes a social necessity which the child 
accepts of his own accord. 

The internalization of the motives of 
moral thought and the freeing of the indi- 
vidual from the immediate state of the moral 
situation are decisive moments in the decen- 
tralization of the self. As a result, moral 
judgment follows from an integration of 
different viewpoints. 


What similarity is there between the adapta- 
tion of the mind to the physical environ- 
ment and to the social environment? 


According to the findings of Piaget (2), 
the adaptation of the mind to the physical 
environment goes through the following 
stages, after the period which is characterized 
by the setting up of motor intelligence: 


1. From the first signs of speech (1 year, 
6 months to 2 years) in a period which lasts 
until 4 years, a symbolic and preconceptual 
thought is developed. 

2. Between 4 and 7 or 8 years, a kind of 
intuitive thought is established, distorted by 
egocentrism, S’s viewpoint being relative to 
his own action and not decentralized into an 
objective system. 

3. From the ages of 7 or 8 until 11 or 12, 
certain concrete mental operations are organ- 
ized in the following fashion: the intuited 
relations of the system under consideration 
are at a given moment suddenly organized. 
This organization is accomplished _ first 
through the manipulation of the objects. 
These children cannot reason with purely 
verbal propositions. Reasoning about real- 
ity consists in an interiorized activity which 
is additive, multiplicative, and reversible 
(decentralization). 

4. Formal thought is elaborated from 11 
or 12 years throughout adolescence. The 
child becomes capable of reasoning according 
to formal operations, that is, reasoning by 
pure assumptions without the necessity of 
relation to objects or to the beliefs of the sub- 
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ject, but by relying on the logical necessity of 
the reasoning process itself. 

A comparison of the adaptation of the 
mind to the social environment with Piaget's 
periods summarized above reveals, first, that 
the adaptation of the mind to either the 
physical or the social environment is charac- 
terized by the change from centralization to 
decentralization. 

From the age of 4 to 7 or 8 years, one finds 
in the development of moral thought the 
same characteristics of thought in its adap- 
tation to the physical environment; that is, 
the thought is centralized on a particular 
state of the object and a particular viewpoint 
of the subject. Only, according to Piaget, 
“the climax of egocentrism coincides in de- 
velopment with that of social constraint and 
the opinion of society” (2, p. 192). However, 
according to the results of the present study, 
the climax of social pressure shows itself 
much later (g years, 2 months). 

This disagreement may be attributable to 
the fact that Piaget made a study of theoreti- 
cal judgments, whereas the present study was 
concerned with practical judgment. Accord- 
ing to the hypothesis of Piaget, practical judg- 
ment precedes theoretical judgment. If this 
hypothesis is correct, the age of autonomy in 
theoretical moral judgment will appear much 
later. To clarify this point, it is proposed to 
continue the present experiment on a verbal 
level with the same social group. 

Further, if this disagreement has developed 
because the two studies were made in differ- 
ent social strata, in which different social pres- 
sures are exercised, a repetition of the present 
study in a different social group should reveal 
differences in behavioral characteristics. 

Nevertheless, it would appear necessary to 
distinguish two phases in the evolution of 
moral thought: that during which the child 
is unable to separate the self from that of 
others, his attention being claimed by a par- 
ticular aspect of the object, and that during 
which the child, while rid of egocentrism, 
blindly accepts social pressures. Thus, there 
are egocentrism and sociocentrism. And in 
the adaptation of the mind to the social 
environment there is something more than is 
seen in its adaptation to the physical environ- 
ment. Sociocentrism plays an important part 
in the development of the personality. 
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Piaget’s period of concrete operations in the 
understanding of things may be compared 
with superficial reciprocity, the materialistic 
point of view, and the maintenance of good 
relations between friends. The child is 
capable of recalling the recent past and fore- 
seeing the effects of his moral action. 

In abstract idealism, in justice and moral 
joy may be found the qualities of Piaget’s 
opérations formelles, because the child, reach- 
ing complete detachment from the momen- 
tary state, is able to judge the moral situation 
from an abstract viewpoint, and the moral 
conscience is constituted. 


What relationship exists between moral judg- 
ment and moral action? 


Two types of approval of a moral action 
by the child may be noted in the findings of 
the present study. First, approval may 
emanate from the child’s ignorance of the 
subjective value of his viewpont. Centrali- 
zation in the self and upon things are due to 
the very structure of his thought. On the 
other hand, the existence of a certain way of 
seeing things which is peculiar to the child’s 
social group may reinforce or attenuate this 
way of thinking. This type of approval has 
been observed in young children who ap- 
prove their selfish action. 

The second type of approval emanates from 
recognition of the value of the moral rule. 
This was found in children who disapproved 
their selfish action or approved their generous 
or equalitarian behavior. 

Disapproval of the chosen moral conduct 
is observed in children who, while aware of 
the moral value of their conduct, still behave 
selfishly. Thus, disapproval is the expression 
of conscious realization of the disagreement 
between the moral rule and the moral 
conduct. 

Considering now the evolution of the rela- 
tionship between the acticn and the moral 
thought, a movement of liberation will be 
seen. 

In the beginning, egocentric action prevails, 
and judgment is subordinate to its exigency. 
This relationship is demonstrated by the 
inability of the child to judge the moral situ- 
ation from any viewpoint but his own. In 
this first phase, the child is not free to deter- 
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mine his conduct, because his intellectual 
maturity is not sufficient to enable him to 
make the determination. 

In the second place, the child’s moral 
thoughts are more egotistical than egocentric, 
and he is not free to direct his action by them. 
He understands the value of the moral rule 
as a social constraint, but does not obey it. 
Here, therefore, the egotistical tendency, 
combined with unilateral understanding of 
the rule, limits moral action. But progress 
has been made in that the child no longer 
colors reality by his egocentrism. 

In the third phase, the child is not yet free 
to direct his action, because the moral rule 
understood in a unilateral sense has the upper 
hand. It is here that the climax of social 
pressures should be placed. In this case, 
moral thought directs conduct, but only by 
way of social constraint. 

In the fourth phase, the child begins to 
direct his action himself. Owing to an ever 
increasing exchange of moral and intellectual 
values, he becomes capable of ridding himself 
of centralizations on the self, on a particular 
state of the object, and on the moral rule in 
its unilateral understanding. Thus, from this 
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moment, the child is able to envisage in a 
harmonious whole what emanates from the 
self, from the physical, and from the social. 
In other words, the child can place himself 
among other individuals and other points of 
view. Society no longer exercises deforming 
constraint upon the individual, but stimulates 
the freedom of his activities. This work of 


the freeing of moral thought is completed 
with the internalization of moral appreci- 
ation, detachment from the momentary state, 
reciprocity, and a connecting up of view- 
points. Thus, moral autonomy is constituted. 
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THE VALIDATION OF THE PAROXYSMAL VECTOR OF THE 
SZONDI TEST *? 


GEORGE D. GOLDMAN 


Franklin D. Roosevelt Veterans Administration Hospital 
Montrose, New York 


His study is an experimental approach 

to the validation of one part of the 

Szondi test, its paroxysmal vector. In 
the construction of this instrument, pictures 
of epileptics and hysterics were used as the 
stimulus material for this vector, and Deri 
(2) and Szondi (5) have said that a subject’s 
choices of liking or disliking these photo- 
graphs reflect that subject’s methods of emo- 
tional control. If this is true, the choice 
reactions of epileptic, hysteric, and seizure- 
free neurological patients should differ sig- 
nificantly from each other. The epileptics 
and hysterics are used as experimental groups 
since they should, according to Szondi’s 
theory, have different methods of controlling 
their emotions. The neurological group, 
consisting of seizure-free brain tumor pa- 
tients are used because of their close associa- 


tion clinically with epileptics and also as a 
control since they did not have seizures. 


BacKGROUND 


The literature on the Szondi paroxysmal 
vector is extremely sparse. Fosberg (3) stud- 
ied the changes in the choices on the paroxys- 
mal vector of patients receiving an electrically 
induced grand mal seizure. He found that 
Szondi’s theory of a decrease in selection of 
specific cards, along with a discharge of ten- 
sion, was not substantiated. However, Rapa- 
port (4) describing the results of using the 
test at the Menninger Clinic had reported 
that “an epileptic just before a convulsion will 
usually have a high number of choices in the 
epileptic column; whereas after convulsions 
few or no epileptic pictures will be chosen.” 
Deri (2) writes that the test is “particularly 
apt to follow up and make visible certain 

1 Reviewed in the Veterans Administration and pub- 
lished with the approval of the Chief Medical Director. 
The statements and conclusions published by the author 
are the result of his own study and do not neces- 
sarily reflect the opinion or policy of the Veterans 
Administration. 


2 Based on a paper read at the 1951 meeting of the 
Eastern Psychological Association. 


psychodynamic changes such as the psycho- 
logic changes during the paroxysmal cycle 
of epilepsy.” In light of this controversy in 
the clinical findings the following problem 
suggested itself as a means of validating 
the paroxysmal vector. 


Tue ProspLem 


In Deri’s (2) manual for the Szondi Test, 
the epileptic or e factor and the hysteric or 
hy factor have been interpreted together un- 
der the heading of the Paroxysmal Vector. 
The ¢ factor theoretically refers to the con- 
trol and discharge of aggressive energy, while 
the Ay factor is related to the control and 
discharge of needs for affection and warmth. 
When actual subjects from the disease entities 
pictured on the test are given the test, and 
their choices are compared to a control group 
of seizure-free neurological patients, the 
Szondi test’s paroxysmal vector, if valid, 
should differentiate between the groups. In 
other words, the three groups should have dif- 
ferent choices of liked and disliked pictures 
and, therefore, different modes of reaction 
on the test. 


PRocEDURE 


To test this hypothesis six consecutively adminis- 
tered Szondi tests were given to each of 78 white 
male veterans of World War II. These 78 subjects 
were broken down into three groups which were 
equated as to age, education, and duration of illness. 
Twenty-six of the patients were diagnosed as hav- 
ing idiopathic epilepsy, grand mal, 26 were diag- 
nosed as having seizures of a hysterical nature, and 
a like number were seizure-free neurological pa- 
tients. These diagnoses were established for each 
case by the neuropsychiatrist in charge of that case 
using complete medical facilities available at the 
hospital. For example, all of the epileptic patients 
had abnormal EEG’s while none of the hysterics 
had abnormal EEG findings. The neurological 
patients all had cortical brain tumors and no history 
of seizures. As a further control each group con- 
sisted of the first 26 consecutive patients referred to 
the author for psychological evaluation who met 
the criteria of the study. 

The data were evaluated as follows: The pattern 
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likes and Cislikes was scored as modes of reac 
tion for each of the six administrations of the 
Szondi test. The most frequent pattern of reac- 
tions was then used to characterize the individual. 
For example, if a subject was plus e¢ on four ad- 
ministrations, minus ¢ on one, and open ¢ on the 
last administration, he was characterized as a plus e¢. 

In this way, each subject’s predominant factorial 
reaction was established as being either plus, minus, 
open, or ambivalent on each of the eight factors. 
The frequencies of each of these various modes of 
reaction were then established on all eight factors 
for each experimental group. Examination could 
reveal how many patients in the various groups had 
as their characteristic mode of reaction a plus ¢, 
a minus ¢, an open ¢, or an ambivalent ¢. Then in 
an effort to discover what the chance distribution 
of a group’s modes of reaction would be, probabil- 
ity statistics worked out specially for the Szondi test 
y Cohen (1) were employed. If it is assumed that 
chance governed the choices for the liked and dis- 
liked Szondi pictures, then 8 of the 26 patients in 
each experimental group would be characteristically 
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RESULTS 


The comparison of the obtained choices 
of likes and dislikes for the three experi- 
mental groups and the choices expected by 
chance distribution indicates that the Szondi 
pictures were not of neutral stimulus value. 
Table 1 shows the results of the chi-square 
analyses where the choices on only nine of 
the 24 factors did not differ significantly from 
chance. Included in the nine factors whose 
distribution of likes and dislikes were not 
different from chance, were the choices of 
epileptic photographs by all three groups. 
Thus, the factor loadings on the ¢ factor of 
the Szondi for these experimental groups 
were more or less randomly distributed. 

The evaluation of the differences in inci- 
dences of loadings for the various factors 


TABLE 1! 


p-Vacurs OsTAIneD IN THE EVALUATION OF THE DIFFERENCES IN INCIDENCE OF THE OsTAINED LoapINes 


AND THEORETICAL LoADINGS ON THE E1cHt SZonpI 


Factors FOR THE THREE EXPERIMENTAL Groups 











hy 


e 


P d 


m 


k 





.90>p>.8o 
1 >p>.10 
>p>.10 


-0o1>p 
.oo1>p 
.0o1>p 


-O5 >p>.02 
-10>p>.05 
-05>p>.02 


.0o1>?p 
.0o1>p 
.oo1>p 


2 


-05>p>.02 
-05>p>.02 
-10>p>.05 


-0o1>p 
-05>p>.02 
-05>p>.02 


-50>p>.30 
.20>p>.10 
-70>p>.50 


-05>p>.02 
-30>p>.20 
-o01>p 





* The following abbreviations will be used in all 


Seizure Group; BD—Brain Diseased Group. 


plus ¢, 8 would be minus ¢, 6 open ¢ and 4 would 
be ambivalent ¢. Using a chi-square analysis, this 
chance breakdown of the groups’ modes of reaction 
was compared with their actual incidences of load- 
ings on each of the eight Szondi factors. This 
was done for all three experimental groups. Next, 
the degree of association between scoring category 
loading and clinical group membership was eval- 
uated through a chi-square analysis, in an effort to 
discover which specific Szondi factors differentiated 
the three groups. 

As an additional feature of the study an attempt 
was made to determine the value of giving six to 
ten administrations as Deri (2) recommends. The 
following procedures were employed: Since Deri (2) 
had said that evaluation of changes in factorial re- 
actions or loadings was extremely important in 
differentiating normal from pathologic behavior, 
changes within each scoring category for the con- 
secutive administrations were noted. Then the 
groups were compared as to type and number of 
changes in factorial reactions or “factor loadings.” 
Also, the “factor loading” of the initial administra- 
tion was compared with the modal or most charac- 
teristic “factor loading.” 


tables: 


IE—Idiopathic Epileptics Group; HS—Hysterical 


among the three experimental groups (see 
Table 2) indicated that the pattern of likes 
and dislikes was only significant on one of 
the eight Szondi factors, the s or Sadistic fac- 
tor. Following the test rationale in inter- 


TABLE 2 


p-VaLturs OBTAINED IN THE EVALUATION OF THE 
DIFFERENCES IN INCIDENCE OF LOADINGS FOR 
THE E:cuHt Szonpt Factors AMONG 
THE EXPERIMENTAL GROUPS 








Factor p-VaLUE 





-50>p>.30 
-05>p>.02 
-10>p>.05 
-95>P> .-90 
.20>p>.10 
-30>p>.20 
.80>p>.70 
-50>p>.30 
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Tue ParoxysMAL VECTOR OF THE SZONDI 


preting the meaning of this finding, it was 
noted that the difference in s factor loadings 
reflected one of the clinical differences be- 
tween the groups, in that the epileptics and 
brain tumor patients had strong aggressive 
needs which they consciously accepted, while 
the hysterics repressed their aggressive drives. 
However, the finding that the groups were 
differentiated in terms of their aggressive 
needs is only of secondary importance in this 
study. Since the groups are presumed to be 
different clinically in their manner of con- 
trolling their aggressivity, and since, accord- 
ing to the Szondi’s rationale, the epileptic 
and hysteric should be most sensitive to the 
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SUMMARY AND CONCLUSIONS 


The present report describes an attempt to 
validate the paroxysmal vector of the Szondi 
test through an experimental approach. The 
choice reactions of epileptics, hysterics, and 
seizure-free brain tumor patients were com- 
pared. Also Deri’s suggested administrative 
procedure was evaluated. 

Based on the results of this study: 

1. The rationale used in interpreting the 
Paroxysmal Vector appears to be highly 
questionable. 

2. It seems reasonable to conclude that there 
is little or nothing to be gained from the 
present method of using from six to ten con- 


TABLE 3 


NuMBER OF 


Patients IN Eacn ExperrmentaL Group Wuo Have THE Various Types oF CHANGES 


In Szonpt Factor Loapincs 








hy 
b’ 


Pp m 
b’ b’ 





13 4 3 12 
HS 2 14 14 


179 5 18 5. 215 


3 
7 17 8 6 19 § 6 3 26 9 
217 9 4 21 4 I 3 5 15 9 


5 
BD 11 4 23 13 12 





a’=The number of patients in each group who have a change in Szondi factor loading from plus (+) to 
minus (—) or minus to plus for consecutive administrations. 

b’=The number of patients in each group who have other types of change in Szondi factor loadings for con- 
secutive administrations of the test. 

c’==The number of patients in each group who do not have any change in Szondi factor loadings for con- 


secutive administrations of the test. 


e and Ay photographs, one would expect sub- 
jects in the three groups to pick different pho- 
tographs as liked and disliked. The fact that 
their choices did not differentiate the groups, 
especially on the e¢ and Ay factors, leaves the 
validity of Szondi’s paroxysmal vector open 
to question. 

With respect to the secondary problem of 
investigating Deri’s suggested administrative 
procedure the following results emerge. The 
modes of reaction of the initial administra- 
tion turned out to be the same as the most 
frequently obtained factor loading for 68 of 
the 78 subjects. Furthermore, changes in the 
factor loadings for consecutive administra- 
tions (see Table 2) did not differentiate the 
groups in any way. 


secutive administrations. One administra- 
tion of the test seems sufficient. 
Further experimental validation studies are 
necessary to clarify the rationale underlying 
the other Szondi vectors before one can use 
the interpretations of this test with confi- 
dence. 
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A CLINICAL STUDY OF SET IN INTRASERIAL LEARNING’ 


ALVIN R. MAHRER 
The Ohio State University 


tncE Jacob’s (2) extension and elabora- 

tion of the digit span method, the anal- 

ysis of errors has been approached from 
at least two points of view: that found in 
clinical testing and that found in experimen- 
tation on intraserial learning. 

In clinical testing, Rapaport (4) said that 
“it is generally advisable to watch for the type 
of errors occurring in repetition of digits” (4, 
p. 180). He related various types of errors to 
such concepts as anxiety, failure of attention, 
disorganization. In discussing the relation- 
ship between the score on the digit span 
test and the depth of the maladjustment, 
Rapaport said that “impairment is not neces- 
sarily indicative of the depth of the malad- 
justment, but rather refers to a specific dis- 
turbing factor or factors which may be present 
in any kind of clinical or normal case” (4, p. 
182). 

Another method of analyzing the perform- 
ance of clinical patients on digit span tests 
is that used in Mitchell’s study (3). She 
analyzed the anticipatory errors found in the 
repetition of digits by 213 mental hospital 
patients and 80 nonhospitalized patients. They 
were asked to repeat digits orally until they 
failed three series of any given length. Mitch- 
ell concluded that the anticipatory errors 
were alrnost identical for the two groups of 
subjects. 

The present study attempts to analyze digit 
span errors by utilizing the concept of set. 
In his critical review of the concept of set, 
Gibson (1) treated it as a “mode of appercep- 
tion . . . a perceptual habit developed by past 
experience.” The concept of set can be 
related to the digit span type of learning, 
since, in past experiences with digits, the 
digits have been perceived not as discrete 
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entities, but as organized in a definite, learned 
sequence. Since this sequence has been 
strongly overlearned, it is believed that when 
an individual is asked to repeat a group of 
digits, this overlearned sequence constitutes 
“a perceptual habit developed by past ex- 
perience,” i.e., a set. 

If the digits in a digit span series were 
3> 45 5, 6, 7, 8, 9, it is likely that many persons 
would repeat the series correctly. One reason 
for this is that the set developed by past 
experience and the correct repetition would 
coincide. However, if the digit span series 
was 3, 4, 5, 6, 8, 7, 9, the set would hinder the 
correct repetition of digits since there would 
be some tendency to alter the digits to con- 
form to the overlearned sequence (e.g., to 
change 3, 4, 5, 6, 8, 7, 9, to 3, 4, 5, 6, 7, 8, 9). 

There are at least two factors which de- 
termine the strength of this tendency. One 
factor is the amount of previous experience 
with digits; with no previous experience 
there would of course be no set. Another 
factor is the degree to which the digits of 
the digit span series depart from the over- 
learned sequence. 

Among the other factors which might in- 
fluence an individual’s digit span performance 
is that investigated by Rapaport and by 
Mitchell, i.e., “depth of the maladjustment.” 
Rapaport compared clinical and normal 
groups in terms of the digit span score. He 
found that a lowered digit span score “is 
mainly indicative of the presence of anxiety,” 
but since “anxiety is a common feature of 
most clinical and normal groups, this lowered 
Digit Span score is generally not a diagnostic 
indicator” (4, p. 193). Using the score on 
the digit span test, Rapaport thus concluded 
that “depth of the maladjustment” was not a 
differentiating factor. In her study of antici- 
patory digit span errors, Mitchell seemed to 
support Rapaport’s conclusions by her find- 
ing that the group of hospitalized mental pa- 
tients and the group of nonhospitalized pa- 
tients did not differ significantly. 

The effect of “depth of the maladjustment” 
will also be investigated in the present study. 
On the basis of Mitchell’s finding it seems 
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likely that the formation of the set to per- 
ceive digits in terms of a learned sequence 
or order is not significantly related to “depth 
of the maladjustment.” 

Hypotheses. 1. As a series of digits ap- 
proaches a one-through-nine sequence, the 
errors will tend to structure the series to con- 
form to the one-through-nine sequence. Since 
there is believed to be no relationship be- 
tween the “depth of the maladjustment” and 
the formation of the set, we would expect 
to find no evidence to support the following 
hypothesis: 

2. T' re will be a significant difference 
between a group of mental hospital patients 
and a group of hospital attendants in their 
tendencies to structure digit span series to 
conform to the learned sequence. 

Subjects. Two groups of subjects were used. 
The first group consisted of twenty mental 
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centages of interjudge and intrajudge agree- 
ments ranged from 92.5 to 97.5 per cent. 
Table 1 indicates that the patient and attend- 
ant groups were closely matched in these 
important variables. 

Materials. Fifty-two series of digits were 
used. These and the instructions were re- 
corded and presented individually to the sub- 
jects. The instructions were: 


Many people say that they would like to have 
a better memory. We are going to give your 
memory an easy exercise that may improve it. 
I am going to say some numbers. When I am 
finished, I would like you to repeat them. For 
example, if I say 7-1-9, you would say (pause) 
7-1-9. Mariy people find that as they continue 
they are able to remember the numbers much 
better than when they began. About three seconds 
before each series, a buzzer will sound so that 
you will be ready for the next series of numbers. 
Here is an example: first the buzzer, then the 


TABLE 1 


MATcHING ON AMOUNT OF Previous ExpeRIENCE wiTH NUMBERS 








Group 


EDUCATION 
MEAN SD 


OccuPATION 


Hicu MEDIUM 


Low 





Patients 
Attendants 





hospital patients at the U. S. Veterans Hos- 
pital at Chillicothe, Ohio. The other group 
consisted of twenty hospital attendants. 

Since it was possible that the effects of 
“institutionalization” might have been found 
among some of the attendants who had been 
employed at the hospital for many years, 
the attendants were selected from a group 
of those most recently hired; all were given 
a battery of psychological tests before being 
accepted for employment. 

The two groups were matched on variables 
believed to be highly correlated with amount 
of past experience with numbers. These 
variables were age, education, and occupa- 
tion. The first two were measured in number 
of years; the third variable, occupation, was 
rated on the basis of judged degree of use 
of numbers in the subject’s previous occu- 
pations. A description of each job and the 
length of time at each job were secured for 
each subject. These job descriptions were 
rated by three raters in terms of high, 
medium, or low use of numbers. The per- 


numbers. (Bzzz) ... 
... All right. 


7-1-9. Do you understand? 
Let’s begin. 


A foot pedal attached to the recorder, and 
operated by the examiner, enabled the subject 
to proceed at his own pace. 

The series varied in length from four to 
nine digits, with no duplication of digits 
within any single series. The fifty-two series 
were of three kinds: 18 predominantly as- 
cending series, 18 predominantly descending 
series, and 16 random series. The predom- 
inantly ascending series had a general upward 
directionality, ic. the highest digit was in 
the last third of the series, and the lowest 
digit was in the first third of the series. The 
predominantly descending series were defined 
as those in which the highest digit was in 
the first third of the series, and the lowest 
digit was in the last third of the series. The 
digits of each random series were selected 
from a table of random numbers. 

If the subjects had been given all of the 
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predominantly ascending series, followed by 
all of the descending series, it is likely that 
they would have developed an “ascending” 
set on the basis of the order of presentation 
of the first 18 series. To avoid this, the order 
of presentation of the three types was de- 
termined from a table of random numbers. 

The predominantly ascending and descend- 
ing series had a general upward and down- 
ward directionality respectively. This means 
that certain digits in each series were out of 
sequence. If these digits that were out of 
sequence occurred at the same location in 
each series (e.g. at the third and sixth digits), 
it is likely that the subjects might have de- 
veloped a set to expect the out-of-sequence 
digits at these locations. To avoid this, the 
locations of the out-of-sequence digits were 
determined from a table of random numbers. 

Scoring and methods of analyses. Each 
series was scored as correct or incorrect. In 
the incorrect series the analysis was concerned 
only with the first error in each series. This 
error was classified as a “higher” error or 
a “lower” error. A higher error was defined 
as an error that was higher than the digit 
preceding it (e.g. changing 2, 7, 3, 9 to 2, 7, 
8, 9); a lower error was defined as an error 
that was lower than the digit preceding it. 

Certain responses were excluded by this 
definition of scorable errors. These included: 
failure to respond at all; giving an incorrect 
response on the first digit of the series; giving 
a correct but incomplete repetition (e.g. 
giving 3, 9, 4, 6 to the series 3, 9, 4, 6, 2, 3); 
and repeating a digit (e.g. giving 3, 9, 4, 6, 6 
to the series 3, 9, 4, 6, 8). 

From the first hypothesis, we would expect 
that (a) in predominantly ascending series 
there will be a significantly greater frequency 
of higher than lower errors, (4) in pre- 
dominantly descending series there will be 
a significantly greater frequency of lower than 
higher errors, and (c) in random series there 
would be no significant differences in the 
frequency of higher and lower errors. 

For each of the three kinds of series, the 
frequency of higher and the frequency of 


lower errors were compared by means of the 


chi-square test, 1 theoretical 50:50 
chance distribution. The justification for 
using a 50:50 chance hypothesis is apparent 
in the random series. In the other two types 
of series, however, it might be asked if the 
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structuring of the digits prevented the 50:50 
chance hypothesis from being used. For 
example, in predominantly ascending series 
there might have been a greater tendency 
for the error to be higher than the digit pre- 
ceding it, since, by definition, the first few 
digits in predominantly ascending series 
tended to be “low” digits. The definition 
of the predominantly ascending series im- 
plies, however, that the last few digits tended 
to be “high” digits. Therefore, the structur- 
ing effects of the first few digits were 
counterbalanced by the structuring effects of 
the last few digits. Thus the 50:50 chance 
hypothesis was considered applicable for all 
three types of series. 

To test the second hypothesis, involving 
the differences between the two groups, the 
frequency of the higher and the frequency 
of the lower errors of the patient group were 
compared by means of the chi-square test 
with the frequency of higher and lower 
errors of the attendant group. The two 
groups were compared on each of the three 
kinds of series. 

In using the digit span test, it has been 
frequently observed that two adjacent digits 
have been reversed in the repetition (e.g. 
changing 1, 4, 3, 7, 9 to 1, 3, 4. 7,9). A “re- 
versal error” was defined as an error in which 
the positions of two immediately adjacent 
digits have been reversed, and the rest of 
the digits in the series have been repeated 
correctly. A reversal error was scored as (a) 
an upward reversal (e.g. changing 1, 4, 3, 7; 9 
to 1, 3, 4, 7, 9) or (6) a downward reversal 
(e.g. changing 1, 4, 3, 7, 9 tO I, 4, 75 35 9)- 

Each series had a finite number of upward 
and downward reversals (e.g. the series 1, 4, 
3, 7, 9 contains one upward reversal and 
three possible downward reversals). For each 
of the three kinds of series, it was therefore 
possible to compare the observed frequency 
of upward and the frequency of downward 
reversals with the theoretical frequency of 
upward and downward reversals. This was 
done by means of chi-square to test the first 
hypothesis. 

The frequencies of upward and downward 
reversal errors committed by the patients 
were then compared with those committed 
by the attendants. This was done using the 
chi-square test in each of the three kinds of 
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series, in order to test the second hypothesis 
involving the comparison of the two groups. 


REsuLTs 

There was a total of g12 errors; 307 were 
on the predominantly ascending series, 304 
on the predominantly descending series, and 
301 on the random series. Of the g12 total 
errors, 207 were reversal errors. 

Table 2 contains the results of testing. 
Chi-square values significant at the 1 per 
cent level are marked with a double asterisk; 
those significant at the 5 per cent level are 
marked with a single asterisk. 


TABLE 2 
Cut-sQUARE VALUES IN THE COMPARISON OF PATIENTS 
AND ATTENDANTS, Errors, AND ReversAL Errors 


REVERSAL 
ERRors Errors 





Ascending series 
Patients 
Attendants 
Patients vs. Attendants 
Descending series 
Patients 
Attendants 
Patients vs. 
Random series 
Patients 
Attendants 
Patients vs. Attendants 


Attendants 





First hypothesis. In the predominantly as- 
cending series, the results indicated that, for 
both groups, there were a significantly greater 
number of “higher” than “lower” errors. Re- 
garding reversal errors, there was a significant 
tendency for the reversals to be in the upward 
direction. 

In the random series, for both groups there 
was no significant difference in the fre- 
quency of higher and lower errors. In terms 
of reversal errors, the attendant group demon- 
strated no significant tendency for the re- 
versals to be in either the upward or down- 
ward direction. However, the patient group 
did demonstrate a significant tendency for 
their reversal errors to be in an upward di- 
rection. With this one exception all the 
above findings support the first hypothesis. 

Second hypothesis. In terms of the number 
of higher and lower errors no significant 
differences were found between the two 
groups on either the predominantly ascend- 
ing, predominantly descending, or random 
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series. In terms of reversal errors, the two 
groups did not differ on either the pre- 
dominantly ascending or descending series. 
On the random series, however, the groups 
did differ, ic., the patient group demon- 
strated a significantly greater tendency for 
their reversal errors to be in an upward di- 
rection. With this one exception all of the 
results support the second hypothesis. 


CoNCLUSIONS 

The results appear to warrant the following 
general conclusions: (4) As a series of digits 
approached a one-through-nine sequence, the 
errors tended to structure the series to con- 
form to the one-through-nine sequence. (6) 
No significant differences were found be- 
tween a group of mental hospital patients 
and a group of hospital attendants in their 
tendency to structure the digits in accordance 
with the general directionality. That is, the 
results failed to indicate that “depth of the 
maladjustment” was significantly related to 
the types of errors investigated. 

With the following single exception, the 
results supported the above conclusions: on 
the random series, the patient group demon- 
strated a significant tendency for their re- 
versal errors to be in an upward direction; 
the patient group significantly differed from 
the attendant group in this respect. 


IMPLICATION 
We have investigated an important kind 
of error in the digit span test and found it to 
be unrelated to “depth of the maladjustment,” 
i.c., it occurred in both a designated normal 
and abnormal group. However, by utilizing 
the method of analysis suggested in this 
study, it seems possible to predict, to some 
degree, the position and kind of the error. 
It is likely that the errors (i.e. behaviors) 
in other types of clinical tests might similarly 
be interpreted in terms of the concepts of 
learning theories. 
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ATTITUDE AS A DETERMINANT OF DISTORTIONS IN RECALL 


MILTON ROKEACH 
Michigan State College 


T is almost half a century since Freud first 

advanced the notion in his Psychopath- 

ology of Everyday Life (8) that slips of 
the tongue and pen are frequently meaning- 
ful acts, serving a function for the individual. 
It is curious that with all the ingenuity ex- 
perimental psychologists have demonstrated 
in attacking difficult problems little or no 
empirical research has been done to deter- 
mine the validity of Freud's interpretation 
of this everyday phenomenon. 

One method for producing and studying 
slips suggested itself as a result of the follow- 
ing incident which the writer had occasion 
to observe: Mr. A, a person known to be 
relatively prejudiced, was saying to Mr. B 
that Negro students generally don’t do as well 
academically as white students. To illustrate 
his point, he cited the case of a particular 
Negro student, mistakenly referring to him 
as Mr. Brown. Mr. B, who also knew the 


student, raised the question as to whether 


Brown was the correct name. Mr. A, after 
some hesitation, acknowledged that it wasn’t 
and, upon further reflection, recalled the cor- 
rect name—jJohnson, 

This incident served as a sort of model 
which we hoped to reproduce and elaborate 
on experimentally. It seemed to us that an 
experimental approach to the problem of 
slips might provide us with a fresh tech- 
nique for the study of repression. As Sears 
has pointed out: “There is little to be con- 
cluded from the experimental study of repres- 
sion. The non-analytic data offer no 
refinement of the theory, no addition of rele- 
vant new variables, no streamlined tech- 
niques that promise eventual solution of the 
problems posed by Freud. . . . The triviality 
of obtained differences in this field makes 
a most discouraging picture” (11, p. 120). 

One possible reason for this “discouraging 
picture” is that in these studies the main 

1 We may define a slip (of the tongue or pen) as 
any error in speech or writing regardless of cause. It 


remains to be seen whether the slips to be produced in 
this study are a function of dynamic factors. 


concern has been simply to compare the dif- 
ferential recall of threatening and nonthreat- 
ening material. Thus, the most that may be 
accomplished by such studies is to provide a 
yes-or-no answer regarding the validity of the 
Freudian hypothesis that ego-threatening ma- 
terial is forgotten. Beyond this, such studies 
have not been able to go. 

Frequently, however, the repressed idea 
leaves an overt trace in the form of a false 
recollection which takes a certain direction. 
Such errors, Freud contends, “are compro- 
mise-formations; they express part-success 
and part-failure for each of the two inten- 
tions; the threatened intention is neither en- 
tirely suppressed nor . . . does it force itself 
through intact” (9, p. 60). Thus, the experi- 
mental study of such false recollections 
should enable us to go beyond the mere de- 
termination that repression is a process which 
does or does not take place, to a more re- 
warding study of the direction of the result- 
ing cognitive reorganization. 

There is possibly a second reason for this 
“discouraging picture.” Repression and 
other dynamic mechanisms have generally 
been conceived as functions of what Allport 
has called the individual rather than the com- 
mon aspects of attitudes (2). As long as re- 
pression is conceived in this manner one is 
confronted with the difficult task of having 
to set up test situations dealing with unique 
individuals having unique attitudes toward 
unique persons. If, however, repression is 
conceived also as being a function of the com- 
mon aspects of attitudes one can think of set- 
ting up situations in which groups of indi- 
viduals have common attitudes toward groups 
of people. 

With these considerations in mind we at- 
tached various emotionally-toned and neu- 
tral names to a relatively large number of 
Negro and white photographs. These were 
to be presented to subjects relatively high and 
low in prejudice for incidental recall. To 
determine individual as well as group atti- 
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tudes each subject would be asked to evaluate 
each photograph for friendliness-unfriendli- 
ness. It was anticipated that in such an 
unstructured situation many slips would oc- 
cur and that individual and group attitudes 
would be the most important determinant 
of such slips. More specifically, we hypothe- 
sized that: (a) Relatively high-prejudiced 
persons should show more distortion than 
relatively low-prejudiced persons in recalling 
names attached to Negro photographs. (4) 
Furthermore, the direction taken by at least 
some of these distortions should be indica- 
tive of compromise formations. (¢) There 


police files* and all were of standard front pose 
with head and neck showing. Set II was composed 
of 4 slides, each slide showing 12 persons, half 
white, half Negro, half male, half female. These 
faces were arranged randomly in 4 rows and 3 
columns. The 48 persons shown in the group 
slides in Set II were the same persons shown in 
the 48 individual slides in Set I. However, a name 
was printed under each of the faces shown in the 
group slides in Set II. These names and the man- 
ner in which they were distributed are shown in 
Table 1. 

Table 1 shows that each person had a given name 
and surname. The given names were assigned to 
lend an air of reality but were otherwise considered 
unimportant for present purposes. Twelve surnames 
were used: White, Gray, Greene, Brown, Black, 


TABLE | 


NaMEs OF THE 48 Persons SHown 1n Set II 


Mace WuirTEs Femace WHITES 





Mare NEGROES FeMALe NEGROES 





. Charles White 
*2. William Gray 
. Fred Greene 
Frank Brown 
. James Black 
» Robert Lynch 
. Peter Cole 
. John Best 
. Louis Good 
. Ralph Low 
. Walter Smith 
12. Thomas Jones 


. Jean White 

2. Edith Gray 

. Ann Greene 

. Helen Brown 

. Barbara Black 

. Mary Lynch 

. Ruth Cole 

° Joan Best 

. Shirley Good 
10. Betty Low 
*11. Dorothy Smith 
12. Mildred Jones 


. Ann White 

. Mary Gray 

. Betty Greene 
4. Shirley Brown 
5. Edith Black 
*6. Dorothy Lynch 
7. Barbara Cole 
*8. Helen Best 
9. Jean Good 
10. Mildred Low 

*11. Joan Smith 
12. Ruth Jones 


*1. Robert White 
Fred Gray 
John Greene 
Peter Brown 
Ralph Black 
Charles Lynch 
Thomas Cole 
Frank Best 
Walter Good 
James Low 
William Smith 
Louis Jones 


*3, 
3. 
4: 
5. 

*6. 

bw D 

°8. 
9. 

10. 

“11. 


12. 


I 
3 





* Incidental recall of names was determined for these 


should be no differences between the low- 
and high-prejudiced groups with respect to 
names attached to white photographs. (d) 
Names attached to photographs perceived as 
unfriendly should be distorted more often 
in recall than names attached to photographs 
perceived as friendly. 


Tue ExpertMent ” 


Materials 


The materials consisted of two sets of 3X4 in. 
photographic slides. Set I was composed of 48 in- 
dividual photographs of which 12 were male whites, 
12 female whites, 12 male Negroes, and 12 female 
Negroes. The pictures in each subgroup were 
matched roughly for age, the range being from 
about 20 to 50 years, as estimated from the photo- 
graphs. These photographs were obtained from 


2 The writer wishes to express his appreciation to 
Mr. Jack Maatsch who conducted the experiment and 
to Mr. Wendell Roberts who helped in the analysis of 
the data. 


24 persons only. See explanation in text. 


Lynch, Cole, Best, Good, Low, Smith and Jones. 
Each of these 12 names was assigned to four photo- 
graphs differing in race and sex. For example, there 
were four photographs bearing the name White: a 
male white, a female white, a male Negro and a 
female Negro. The same design holds for the other 
names. 


Subjects 


The 10-item Ethnocentrism Scale (1, 10) was ad- 
ministered to about 90 white students in introduc- 
tory psychology at Michigan State College. From 
this group, twenty Ss were obtained for individual 
testing, ten from the upper and ten from the lower 
quarter of the distribution. 


Procedure 


Part I. The Ss were brought individually into a 
dark room. Twenty-four of the 48 individual slides 
of Set I, without names, were projected onto a 


8 The writer is greatly indebted to Lt. Joseph F. Smith 
of the Kansas City Police Department and to Dr. Ralph 
Turner of the Department of Police Administration at 
Michigan State College for making these photographs 
available to the writer. 
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screen 15 feet away in a predetermined random 
order. The 24 photographs shown were those 
marked with an asterisk in Table 1. These photo- 
graphs were composed of 6 male whites, 6 female 
whites, 6 male Negroes and 6 female Negroes. Only 
those having the following 6 names were repre- 
sented: White, Gray, Lynch, Cole, Best and Smith! 


The instructions were as follows: 


I am going to present some faces to you one at a 
time Will you look the faces over and then try to 
estimate the age in years, the height in feet and inches, 
and the friendliness as briefly and specifically as possible. 
Write these estimations in the appropriate columns as 
quickly as | Are there any questions before we 


begin? 


ssible. 


The qualitative comments on perceived friend 
liness were subsequently categorized as “Friendly” 
or “Unfriendly.” 

Part Il. The S was then shown the four group 
slides of Set II, with names attached. These in 
cluded the 24 faces shown in Part I and 24 addi- 
tional faces. The instructions: 


Now I am going to show you 4 different slides. Each 
of the slides has 12 faces on it. You will notice that 
each picture has a name printed beneath it. Some cf 
these pictures are the ones you saw before and some are 
not. Please write down the names of those you recog 
nize in the order you recognize them. The first picture 
you recognize will be the first name on your sheet and 
the second one you recognize will be your second 


response. Are there any questions before we begin? 


Part Ill. The S was then shown the same 24 
faces presented in Part I (without names) and in 
the same order with the following instructions: 


Now I will show you the individual pictures again. 
Please write down his or her name. If you can’t remem- 
ber it, take a guess. Your first guess is the most 
accurate. Once again, if you are not sure of the person's 


name, guess. Are there any questions before we begin? 


REsULTs 

preliminary considerations. The 
high- and low-prejudiced groups seem to 
be equated in initial memorial ability. The 
Highs recall 17 per cent of the Negro and 
14 per cent of the white faces correctly. The 
Lows recall correctly 12 and 14 per cent, re- 
spectively. 

The names correctly recalled cannot be at- 
tributed to any specific learning of names- 
with-faces. Since the 24 faces shown in Parts 
I and III had only six different names 
(White, Gray, Lynch, Best, Cole, Smith), 
the correct name should be recalled consid- 
erably more often than once in six (17 per 
cent) to be attributed to the operation of a 


Some 


nted in 
limitations and to pre- 


4Only 24 of the 48 photographs are pres 
Parts I and III because of time 


vent the onset of fatigue factors 


Mitton RoKkEAcH 


specific learning factor. The obtained fre- 
quencies of correct recall are very close to this 
theoretical frequency. Hence, we may as- 
sume that all the names recalled are incor- 
rect. That this assumption is fairly valid is 
supported by the fact that the Ss complained 
in Part III that the task was extremely diffi- 
cult, that they hadn’t been told this was a 
memory test, that they made no effort to 
learn the names, etc. 

To consider all the responses as incor- 
rect, however, does not mean that they are 
the result of purely chance factors. The ex- 
perimentally introduced names were recalled 
by the 20 Ss to 85 per cent of all the faces 
shown (408 out of 480) and it remains to be 
seen whether selective factors were operat- 
ing.® 

Table 2 shows the frequency of recall of 
each of the names by the Low and High 
groups to the Negro and white photographs. 
Also shown are the x* values and the sig- 
nificance levels obtained for each name by 
comparing the Low and High groups. Table 
2 shows the x? values obtained for all the 
data shown in Table 2, for the prejudice and 
race variables, and for the interaction between 
these two variables. 

As seen in Table 3 


the total x* is 60.17 
which, for 27 degrees of freedom, is sig- 


nificant beyond the 1 per cent level. The 
major portion of this value is contributed 
by the prejudice variable. The x* is 38.49 
which, for 9 degrees of freedom, is also sig- 
nificant beyond the 1 per cent level. 

These are in contrast to the results found 
for the race variable. An inspection of Table 
2 shows that for both the Low and High 
groups the frequency of recall of the various 
names to the white photographs is strikingly 
similar to those found for the Negro photo- 
graphs. It is seen in Table 3 that there is 
no significant difference in the frequency of 
recall of the various names to the white and 
Negro photographs (x? = 9.28). Moreover, 
the interaction between prejudice and race is 
also nonsignificant (x* = 12.40). Thus the 
expectation that the Lows and Highs would 
differ with respect to the recall of Negro 

5In this respect our methodology departs from that 
of most selective recall studies. In these studies the 
analvsis is in terms of the differential correctness of 


recall. In the present study, the analysis is in terms of 
the differential incorrectness of recall. 
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TABLE 2 


FreQueNcY 


oF Recatt or Names sy Low- ano Hicu-Preyupicep Groups to Necro AND 


Wuite PHoToGRAPHS 











Hicuhs 








Whitt 
PHotos 


NEGRO 
PHoTos | 


*White 
*Gray 
Greene 
Brown 
Black 
*Lynch 
*Best 
*Cole 
*Smith 

Jone s 

Low 

Good 

Other Names 
No response 


Totals 


Wuire | 


PHoTos Toras 


13 
10 


240 





* Names preceded by asterisk are for the 24 photo 
** The chi-square value is indeterminate because of 


names (Hypothesis 1) but not with respect 
to the white names (Hypothesis 3) is not 
completely borne out. The two groups differ 


significantly from each other with respect to 
the recall of both Negro and white names. 

To determine more specifically what the 
nature of these differences might be it is 
necessary to examine the significant Low- 
High differences found for the various names. 
It is seen in Table 2 that the high-prejudiced 
group recalls the names Gray, Greene, and 
Lynch to both Negro and white photographs 


TABLE 3 


Cut-Souvart ANALYsIs OF Over-ALL Resutts SHOWN 
In Tasre 2 


Total 
A. Low vs. High 38. 
B. Negro vs. White 9. 
Interaction AXB 12. 





* To eliminate small theoretical frequencies the data 
for Greene, Brown and Black were lumped together and 
the data for Jones, Low and Good were lumped together. 
We thus have a 4X10 table with 27 degrees of freedom. 
To climinate the possibility of bias, we also combined 
the data for all six of these names. The significance 
levels found are the same as those shown above. 


graphs shown in Parts I and III of the experiment. 


low theoretical frequencies. 


15, 14, and 38 times, respectively. The low- 
prejudiced group recalls Gray, Greene and 
Lynch only 6, 0, and 19 times, respectively. 
The x? values are 4.04, 14.42, and 7.19, re 
spectively. For one degree of freedom, the 
latter two values are significant beyond the 
1 per cent level and the former value beyond 
the 5 per cent level. 

The Lows, on the other hand, recall the 
name Jones 13 times as compared with 4 
times for the Highs. The x? value is 4.93, 
significant beyond the 5 per cent level. More- 
over, the Lows fail to respond (“No response” 
in Table 2) 33 times as compared with the 
Highs’ 10 times. The x? value is 13.51, sig- 
nificant beyond the 1 per cent level. 

These specific findings further indicate 
that the Highs distort more than the Lows 
to both sets of photographs. The greater re- 
call of Gray, Greene, and Lynch by the Highs 
is qualitatively different from the greater re- 
call of Jones and the greater number of “No 
responses” given by the Lows. We will dis- 
cuss the possible theoretical significance of 
these results later on. 

® To increase the theoretical frequencies in each cell 
we first combined for each name the data obtained for 
the Negro and white photographs. We felt this to be a 


justifiable procedure since the Negro-white variable was 
not significant. 
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We turn, finally, to a consideration of the 
results found for the photographs perceived 
as friendly and unfriendly. The various names 
were recalled about equally often to photo- 
graphs perceived as friendly and unfriendly. 
The over-all results are not significant, and 
none of the names is recalled significantly 
more often to friendly or to unfriendly pho- 
tographs.’ The interaction between preju- 
dice and perceived friendliness-unfriendliness, 
as determined by x* analysis, is also nonsig- 
nificant. 

It is only with respect to the “No response” 
category that the friendliness-unfriendliness 
variable is significant. As is shown in Table 
4, of the 43 “No responses,” 32 are given 
to photographs perceived as friendly and 
only 11 to photographs perceived as un- 


‘TABLE 4 


Frequency or “No Responses” Given To Puorto- 
GRAPHS PEeRceIveD AS FRIENDLY AND UNFRIENDLY 


No Art OTHER 

RESPONSE RESPONSES 

Friendly 32 211 
Unfriendly I 226 237 


Totals 43 437 480 





friendly. The x? value is 11.27 which, for one 
degree of freedom, is significant beyond the 
1 per cent level. 

A word about the recall of “other names.” 
In 29 instances names other than the experi- 
mentally-introduced names are recalled 
such as Johnson, Roberts, Rinehart, Stuart, 
Cook, and Doe. As is shown in Table 2, the 
Lows and Highs recall these names about 
equally often. It is of interest, however, that 
one Low S recalls the name Honey to a fe- 
male Negro perceived as friendly and one 
High S recalls the name Savage twice to two 
female Negroes perceived as unfriendly. 

Some incidental results. At the end of the 
experiment we had the Ss estimate the num- 


7 To conserve space we are omitting a presentation of 
the data on friendliness-unfriendliness. We are also 
omitting two other tables containing data for each indi- 
vidual subject. To obtain these data, order Document 
2505 from the American Documentation Institute, 1719 
N St.. N.W., Washington 6, D. C., remitting $1.00 for 
microfilm (images 1 inch high on standard 35 mm. 
motion picture film) or $1.00 for photocopies (6x8 
inches) readable without optical aid. 


Mitton RoKkEAcH 


be of Negro and white photographs shown 
in Parts I and III. For the Lows, the mean 
estimate of Negro and white faces shown are 
about equal and in accord with reality, the 
means being 12.7 and 12.5 respectively. For 
the Highs, the mean estimates are 15.0 and 
8.5, respectively. These results are consistent 
with other findings that prejudice accentuates 
the size of threatening out-groups (3). 

Another finding has to do with the fre- 
quency of correct recognition in Part II of 
Negro and white faces seen previously in 
Part I. Seeleman (12) found, with a technique 
similar to the present one, that high-preju- 
diced Ss recognize correctly more white than 
Negro faces and, conversely, that low-preju- 
diced Ss recognize correctly more Negro than 
white faces. Our data do not support this. 
In the present study, the Highs as compared 
with the Lows recognize correctly both more 
Negro and white faces. At the same time, 
they also recognize incorrectly both more 
Negro and white faces. 


Discussion 


For the sake of brevity, we shall confine 
ourselves here only to a consideration of the 
results found to be significant. These con- 
firm the broad hypothesis that attitudinal 
factors are operative in the production of 
slips. The high-prejudiced subjects clearly 
distort more than the low-prejudiced sub- 
jects not only in their recall of Negro names 
but also in their recall of white names. 

This suggests that prejudice against Ne- 
groes may be conceived not only as a mani- 
festation of ethnocentrism (1, 10) but per- 
haps, even more broadly, as a special case 
of a more pervasive misanthropy.® 

That this misanthropy should manifest 
itself in the significantly greater recall of 
Gray and Greene rather than White or Black 
is in accord with Freud’s contention that 
many errors in speech and writing represent 
compromise-formations. We cannot help but 
note, furthermore, that these compromise- 


8 This is corroborated in a recent study by Sullivan 
and Adelson (13). They administered two forms of 
the E scale. One form was as published (1, 10). The 
second form was identical with the first with the excep- 
tion that more general terms were substituted fer the 
names of the more specific outgroups (¢.g., “some 
people” for “some Jews”). They found a significant 
relationship between the scores obtained on the two 
scales. 
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formations are not so very different from the 
compromise-formations of such perception- 
ists as Brunswik (5) and Bruner and Post- 
man (4). 

Bruner and Postman, for example, define 
perceptual compromise as a reaction in which 
“a ‘perceptual middle ground’ exists between 
the expectancy and the stimulus contradicting 
the expectancy” (4, p. 215). Ome way in 
which their subjects reacted to tachistoscop- 
ically-presented incongruous trick cards (e.g., 
a red six of spades) was by compromise. For 
example, “the red six of spades is reported 
as either a purple six of hearts or the purple 
six of spades” (4, p. 214). 

In a sense, the slips committed by our 
Highs are similar to the compromise percep- 
tual reactions to the “trick” card. To be 
confronted with names many of which are 
not in accord with expectancy (e.g., Negroes 
named White and whites named Black) is, 
like the red six of spades, something of an 
incongruity. Being “intolerant of incongru- 
ity,” the cognitive reorganization is often 
in the direction of compromise. 

But this formulation in terms of compro- 
mise seems incomplete to us. It does not 
explain the significantly greater frequency of 
recall of Lynch by the Highs or the greater 
frequency of recall of Jones by the Lows. 
These could hardly be considered compro- 
mise-formations. Moreover, we may be over- 
looking something important by lumping 
Gray and Greene together and considering 
them both as compromise-formations. It 
seems to us that we are dealing with a con- 
tinuous rather than a discrete process. Our 
findings become more comprehensible if we 
conceive of the directions taken by the slips 
as lying along a continuum involving differ- 
ent degrees of compromise which terminate 
at one extreme in assimilation and at the 
other extreme in contrast. 

Viewed in this light the greater frequency 
of recall of Lynch by the high-prejudiced 
group may be indicative of cognitive reor- 
ganization in the direction of assimilation. 
The direction of the distortion seems to be 
such that there is a close congruence between 
the name recalled and the underlying atti- 
tude. 

The case is somewhat different for the 
Lows. The only name they recall significantly 
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more often than the Highs is Jones. This 
too may be regarded as a distortion but the 
distortion seems opposite in direction to as- 
similation. The name recalled is manifestly 
more neutral in affect as if to avoid recalling 
the more negatively-loaded names. It is ex- 
tremely tempting here to juxtapose the Gestalt 
concept of contrast with the psychoanalytic 
concept of reaction formation. But it is not 
possible to tell from our data whether this 
distortion represents a reaction formation de- 
signed to deny some underlying hostility or 
whether it merely represents an attempt to 
avoid other names within the experimental 
situation which are perceived to be negative 
in affect. This is a problem which merits 
further research. 

The Highs, then, are found to distort gen- 
erally more often than the Lows. This is in 
contrast to the greater “tolerance of ambigu- 
ity” in the Lows (7). Relevant here is the 
finding that it is the Low group which fails 
significantly more frequently than the High 
group to make any guesses at all. These 
results suggest that the Lows can more often 
afford to leave the social stimulus as they 
found it—ambiguous rather than structured 
(7), incongruous rather than congruous (4), 
open rather than closed (6). The Highs, in 
contrast, more frequently feel compelled to 
guess at the names, indicating a greater need 
for closure thereby resolving the ambiguity 
and incongruity. 

Frenkel-Brunswik assumes that the basis 
for this “intolerance of ambiguity” in the cog- 
nitive sphere is to be found in the emotional- 
motivational sphere. While we did not find 
any significant differences in the frequency 
of recall of names to faces perceived as 
friendly and unfriendly, it is interesting that 
we did find a significant difference in support 
of this assumption. Significantly more “No 
responses” are given to photographs perceived 
as friendly. This lends strong support to 
Frenkel-Brunswik’s hypothesis that a threat- 
ening social environment leads to “intoler- 
ance of ambiguity” and, conversely, that a 
permissive social environment leads to “toler- 
ance of ambiguity.” 

A final comment: The results obtained in 
this study suggest that the method we have 
employed to study slips is a promising one. 
It is anticipated that with suitable modifica- 
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tions and refinements other problems involv- 
ing repression and cognitive reorganization 
can be successfully attacked. 


SUMMARY 


The present study demonstrates an experi- 
mental method for studying repression by 
producing errors in the recall of names anal- 
ogous to Freudian slips of the tongue. Ten 
low- and ten high-prejudiced Ss were shown 
photographs representing equal numbers of 
male and female whites and Negroes. The 
following names were equally distributed 
by race and sex to the faces shown: White, 
Gray, Greene, Brown, Black, Lynch, Best, 
Cole, Smith, Jones, Low, and Good. The Ss 
judged each photograph for friendliness or 
unfriendliness. Then followed a test for in- 
cidental recall. 

The high-prejudiced group recalled the 
names Gray, Greene, and Lynch significantly 
more often than the low-prejudiced group 
to both Negro and white faces. The low- 
prejudiced group recalled the name Jones 
and gave “No responses” significantly more 
often than the high-prejudiced group to both 
Negro and white faces. The frequency of 
recall of the various names to the Negro and 
white faces was not significantly different 
from each other indicating that prejudice 
against Negroes may be a manifestation of 
a more pervasive misanthropy. 

No significant differences were found in 
the frequency of recall of the various names 
to photographs perceived as friendly and un- 
friendly. However, significantly more “no 


responses” were given to faces perceived as 
friendly than to faces perceived as unfriendly. 
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The results are interpreted in the light of 
Freud’s theory of slips and in terms of a con- 
tinuum of cognitive reorganization ranging 
from assimilation to various degrees of com- 
promise to contrast. Furthermore, the differ- 
ences between the low- and high-prejudiced 
groups are discussed in terms of differ- 
ences in “tolerance of ambiguity,” “tolerance 


ss a a on 
of incongruity,” and “closure. 
REFERENCES 

1. Aporno, T. W., Frenxer-Brunswik, E., Levinson, 
D. J., & Sanrorp, R. N. The authoritarian per- 
sonality. New York: Harper, 1950. 

. Atiport, G. W. Personality: a psychological inter- 
pretation. New York: Holt, 1937. 

. Attport, G. W. Prejudice: A problem in psycho- 
logical and social causation. ]. soc. Issues, 1950, 
Supplement Series No. 4, 1-26. 

. Bruner, J., & Postman, L. On the perception of 
incongruity: A paradigm. j. Pers., 1949, 18, 
206-223 

Brunswik, E. Discussion: Remarks on functional- 
ism in perception. ]. Pers., 1949, 18, 56-65. 

. Fisher, J. The memory process and certain psycho- 
social attitudes, with special reference to the Law 
of Pragnanz. ]. Pers., 1951, 19, 406-420. 

. Frenxer-Brunswik, E. Intolerance of ambiguity 
is an emotional and _ perceptual personality 
variable. /. Pers., 1949, 18, 108-143. 

. Freup, S. The psychopathology of everyday life. 
In The basic writings of Sigmund Freud. (Trans. 
& ed. by A. A. Brill.) New York: Modern 
Library, 1938. 

. Freup, S. A general introduction to psychoanalysis. 
New York: Garden City, 1943. 

o. Levinson, D. J. An approach to the theory and 
measurement of ethnocentric ide ]. Psychol., 
1949, 28, 19-39. 

. Sears, R. R. Survey of objective studies of psycho- 
analytic concepts. Soc. Sci. Res. Coun. Bull., 
1943, No. 51. 

. Seeteman, V. The influence of attitude upon the 
remembering of pictorial material. Arch. Psychol., 
1940, No. 258. 

. Sutzivan, P. L., & Apetson, J. Ethnocentrism and 
misanthropy. Unpublished study. 


logy. 


Received by this Journar, August 16, 1951. 





A COMPARISON OF REMITTING AND NONREMITTING 
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HaT are the differences between 

\X schizophrenic patients who make 

social recoveries and those who 
remain hospitalized? What roles do per- 
sonality and intellectual factors play in 
the improvement and recovery of schizo- 
phrenics? The present study seeks an 
answer to these questions in a comparison 
of two groups of schizophrenics on the 
Wechsler-Bellevue and Rorschach tests and 
on a clinical rating scale. One group con- 
sists of schizophrenic patients who have im- 
proved sufficiently to leave the hospital (the 
remitted group); the other consists of schizo- 
phrenic patients who have remained hos- 
pitalized (the hospitalized group). 

Previous studies have shown that patients 
who recover or improve tend to show better 
contact with reality, more realistic aspirations, 
and more adequate affective reactions than 
unimproved patients (5, 7, 8, 9). Patients 
who improve are more outgoing (1,5). Un- 
improved patients tend to be characterized 
by seclusiveness, irritability, asocial behavior, 
frigidity of mood, irresponsibility, and nar- 
rowing of affective reaction (1). Apparently 
the less the schizophrenic has departed from 
the social norm and the more integrated he 
is emotionally and intellectually, the more 
the chance that he will show a favorable 
response to treatment. 

Few studies of outcome in schizophrenia 
excepting those utilizing intelligence tests 
have attempted to validate the findings of 
one study on a different sample. Little 
attention has been paid to testing the theory 
underlying the psychological instruments 


1 This article is a revision of a dissertation submitted 
in partial fulfillment of the requirements for the degree 
of Doctor of Philosophy at the University of Michigan. 
Thanks are due to Dr. A. Carp, chairman of my 
doctoral committee, and also to Professors Satter, Hutt, 
Kelly, and Bordin for their help on this problem. The 
paper is published with permission of the Chief Medical 
Director, Department of Medicine and Surgery, Veterans 
Administration, who assumes no responsibility for the 
opinions expressed or conclusions drawn by the author. 


themselves by making predictions from the 
theory concerning the performance of clini- 
cal groups on the tests. 

In this study a number of predictions con- 
cerning the relative test performance of the 
two groups of schizophrenics were derived 
from the theoretical assumptions underlying 
the variables measured by the Rorschach and 
the Wechsler-Bellevue tests. The predictions 
were taken directly from Wechslex’s and 
Rorschach’s thinking (10, 11). 

Assuming that the personality differences 
reported by previous studies really exist, the 
following predictions were made: 

1. On the Wechsler-Bellevue test remitted 
schizophrenics will show higher total LQ. 
scores than hospitalized schizophrenics and 
also higher Verbal and Performance scores. 
These predictions assume that the pertinent 
variables regarding background factors will 
be homogeneous. In view of contradictory 
evidence concerning subtest score patterning 
and differences in impairment of functioning 
on performance and verbal tests no predic- 
tions were made concerning these variables. 

2. On the Rorschach test remitted patients 
will show more human movement (M), 
form-color (FC), shading (TSH), popular 
(P), and human content (H, Ad) responses, 
higher F+-%, higher extended F+-°/, lower 
W:M ratio, and more ambiequal and more 
introversive experience-balance. Hospitalized 
patients will show more color-form (CF), 
pure color (C), color-naming, confabulatory, 
contaminated, and position responses, and 
more extratensive and more coartated experi- 
ence-balance. No predictions were made 
with regard to whole (W), common detail 
(D), rare detail (Dd), white space (S) re- 
sponses, or for animal (A°%) or other types 
of content responses, since the interpretations 
which Rorschach attached to these variables 
and subsequent findings by other investi- 
gators did not permit prediction in this study, 
at least with respect to differences between 
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the groups. Original responses were not 
included because of the extreme difficulty in 
scoring them. 

The two groups were also compared with 
a scale developed by the late Dr. Louis 
Granich for coding clinical and _ historical 
items from the case records of patients. Only 
items which could be rated with fair objec- 
tivity were included. No predictions were 
made concerning differences between groups 
for these variables. 


METHOD 


Two samples of subjects served in this 
study. They were drawn from three 
Veterans Administration hospitals. Each 
consisted of patients belonging to either 
hospitalized or remitted groups. In order 
to qualify as subjects for either group patients 
had to be veterans of World War II, white, 
male, and 40 years or less at the time of test- 
ing. They had to have been admitted to the 
hospital sometime between August 1946 and 
August 1948. The testing had to have been 
completed within go days of admission. The 
psychiatric and medical examination had to 
show negative neurological findings. In each 
case a final diagnosis of schizophrenia was 
made on discharge from the hospital and/or 
at the end of the three- or four-week diag- 
nostic period. The patient had to have been 
tested before receiving any form of shock 
treatment during his present hospitalization. 

The two groups were differentiated by 
means of the following criteria: In the hos- 
pitalized group were included schizophrenic 
patients who had been admitted to the hos- 
pital some time between August 1946 and 
August 1948 and who were still patients at 
the same hospital in August 1950. During 
that time they had never been away from 
the hospital for any one period exceeding go 
days. In the remitted group were included 
patients who had been admitted to the hos- 
pital during the same period but who were 
subsequently discharged and have remained 
in remission for a minimum, uninterrupted 
period of at least six months since that time. 
Actually almost all the patients in the re- 
mitted group appear to have remained in 
remission much longer than the minimum 
period. 
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The two groups of schizophrenics were 
obtained as follows: From the entire pool of 
schizophrenic patients subjects were selected 
in accordance with the criteria outlined 
above. Only those patients were selected for 
whom Rorschach or Wechsler records were 
available. The groups were kept equal in 
size, additional cases being eliminated at ran- 
dom. In Sample I drawn from Hospital A 
there were 92 patients for whom Rorschach 
records were available; in Sample II drawn 
from Hospitals B and C there were 56. In 
Sample I there were 88 patients wth Wechs- 
ler records; in Sample II, 56. The scoring of 
all records was checked by the writer. He 
also rescored all Rorschach records. The 
reliability of his rescoring of the records was 
tested by comparing his scoring of selected 
records with that of two other scorers. In 
all instances agreement among scorers was 
over 85 per cent on total Rorschach responses 
(disagreement on any part of the scoring of 
a response being considered a miss for the 
entire response) and over 95 per cent on 
individual scoring items. 

To test the possibility that reports influ- 
enced the disposition of patients, six VA 
trainees judged 20 test reports for probable 
remission or nonremission of patients. The 
group made a mean of 53 per cent accurate 
judgments, barely above chance. The results 
suggest that the reports by themselves did not 
influence greatly the disposition of patients. 

The subjects were permitted to vary ran- 
domly on such variables as age, education, 
occupation, religion, marital status, diagnostic 
subtype, and number of previous hospitaliza- 
tions. No significant differences were found 
in either sample between the remitted and 
the hospitalized patients for any of these 
variables. 

The mean length of hospitalization for re- 
mitted schizophrenics was six months (range 
from one to 22 months) in Sample I, three 
months (range from one to nine months) in 
Sample II. The mean length of hospitaliza- 
tion for nonremitted schizophrenics was 35 
months (range from 24 to 48 months) in 
Sample I, 32 months (range from 24 to 46 
months) in Sample II. There was no over- 
lap between the two groups in either sample. 
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TABLE 1 


CoMPARISON OF REMITTED AND HospiTaLizeD SCHIZOPHRENICS ON WECHSLER Scores IN SAMPLES I AND II 








VARIABLE Mr MA 


Sic A p 


Sic r 





Total IQ 
Sample I 
Sample II 


Verbal IQ 
I 
ll 


Perf. 1Q 
I 
II 


Comp. 
I 


li 
Dig. Sp. 
I 


Il 
Arith. 
I 


II 


Simil. 


Pict. Comp. 
I 
II 


Bl. Des. 
I 
II 


Obj. Assy. 
I 
II 
Dig. Sym. 
9.1 
8 


II 3.6 





88 for Sample I, 56 for Sample II. 


REsuULTs 
Wechsler-Bellevue Findings 


From Table 1 we see that the three pre- 
dictions concerning differences in I.Q. scores 
were borne out in both samples. The remit- 
ted group scored higher on total, verbal, and 
performance I.Q. scores. The differences are 
significant at the .05 level for total LQ. in 
both samples, for verbal LQ. in Sample II, 


and for performance I.Q. in Sample I. The 
remitted group also attained higher scores in 
both samples on 10 out of i1 subtests. The 
hospitalized group scored higher on only one 
subtest, Similarities, and then only in one 
sample. The performance of both groups 
went up or down on much the same subtests. 
The relative ranks of the subtests within each 
group remained fairly constant for all groups. 
The magnitude of the differences between 
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groups on subtests varied from one sample 
to the other. 

The finding of higher 1.Q. scores for the 
remitted group tends to support the assump- 
tion, derived from Wechsler’s definition of 
intelligence (11), that remitted patients have 
a greater present capacity for dealing effec- 
tively with the environment. At the same 
time it must be noted that the absolute dif- 
ferences in 1.Q., while statistically significant, 
are not large. Moreover, the mean 1.Q.’s of 
all groups are in the so-called “normal” range 
of intelligence. The comparatively small dif- 
ferences in 1.Q. scores do not permit a defini- 
tive statement concerning the relation of 1.Q. 
to remission. The correlation between I.Q. 
score and remisson is small for both samples 
(r ot. vie. Of .23 for Sample I, 24 for Sample 
II). 

Rorschach Findings 

A total of 18 predictions were made con- 
cerning differences between the two groups 
on Rorschach variables. For the analysis of 
the data, statistical procedures were set up 
prior to the collection of the data in accord- 
ance with Cronbach’s recommendations (2). 
The chi square or the critical ratio was used 
where either was more appropriate or its 
use recommended. The chi-square categori- 
zations used here were determined in ad- 
vance, primarily by reference to the tables 
of frequency set up by Rorschach (10) for 
the various test variables. 

The chi-square breakdowns occasionally 
become complicated as in the case of the ex- 
perience-balance and W:M. From Table 
XIV of Rorschach’s book (10, p. 86) the fol- 
lowing arrangement was worked out: 

An experience-balance was considered 


1. Coartated if both M and Sum C were equal 
to one or less. 

2. Ambiequal if either M or Sum C were greater 
than one and the difference between the two were 
one or less. 

3. Introversive if M were greater than Sum C 
plus one. 

4. Extratensive if Sum C were greater than M 
plus one. 


The specific predictions concerning the re- 
lationships of the groups on the four break- 
downs were tested by 2X2 tables. For in- 
stance, in comparing the two groups on coar- 
tation, the categorization was in terms of 
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presence or absence of a coartated experience- 
balance. 

For W:M the following categorization was 
employed in accordance with Klopfer’s rec- 
ommendations (6) : 


1. W smaller than, equal to, or greater than M 
up to 2xM. 

2. W greater than 2XM and W and M both 
equal to or less than two. 


For the most part Beck’s system for scor- 
ing location, determinants, and popular re- 
sponses was employed. Shading was treated 
as one category following Rorschach’s treat- 
ment of it, the breakdown being in terms 
of the relation of shading to form (form- 
shading, FSH, shading-form, SHF, and pure 
shading, SH). 

The scoring of F+- was by means of Beck 
with reference to Hertz’s manual (4) for re- 
sponses not found in his tables. In addition, 
all responses which had a form factor in them 
were scored for form-accuracy (F, M, FSH, 
SHF, FC, CF). The proportion obtained 
by dividing the number of “good” form 
responses by the total number of form 
responses was then called the “extended 
(F+%).” 

Since the direction of the differences was 
predicted, only one-half of the normal prob- 
ability curve was utilized. Probabilities at the 
05 level were considered significant. Even 
though obtained differences for a number of 
the variables did not attain statistical sig- 
nificance, if the differences in an extensive 
series of variables were in the direction of 


TABLE 2 


Cur-Square CATEGORIZATIONS OF RoRsSCHACH 


VARIABLES 








VARIABLES BREAKDOWNS 





M 3-up 
TSH 3-up 
FC o I-up 
CF 3-up 
c o I-up 
M:C 
Coar. 
Ambi. 
Intr. 
Extr. 
Conf. 
Cont. 
Pos. 
Col. Nam 


Absence 
Absence 
Absence 
Absence 
Absence 
Absence 
Absence 
Absence 


Presence 
Presence 
Presence 
Presence 
Presence 
Presence 
Presence 
Presence 
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ComPaRIsON oF REMITTED AND HospPITALizep SCHIZOPHRENICS ON RorsCHACH VARIABLES IN SAMPLES I AND II* 








CR 


. Cut-Soa. 


df 


p 





Cont 


II 
Pos. 
I 
II 
Col. Nam. 
I 
II 


<95 


<oi** 
<20°* 


<30°* 
<30°* 


<< 30°* 
<1o0** 


<o1** 
<10** 


<oi** 
<20** 


<20** 
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<o1** 
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<o1** 
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<o.** 
<os5** 


<“o1** 


<10* 


<oo1** 
<oo1** 


<oo1** 


a *e 
001 


< 


<10** 
<os5** 


<oo1r** 
<oi1** 


<os5** 
<20** 


<10° 
<30°° 


<os** 
<o2** 


<os5** 
<so** 





*N = 92 for Sample I, 56 for Sample Il. 
** Prediction was confirmed. 
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the predictions, this in itself would be con- 
sidered to furnish a basis for rejecting the 
hypothesis that membership in either the 
remitted or hospitalized groups was not re- 
lated to scores on the Rorschach test variables 
for which predictions were made. 

The differences between the remitted and 
the hospitalized groups on the total number 
of responses (R) were so small that it was not 
necessary to make allowances for differences 
in R. 

All 18 predictions for the Rorschach were 
confirmed in both samples. The following 
differences were significant at the .05 level 
in both samples: P, F-+-°%, extended F+-% 
position. The following differences were sig- 
nificant at the .o5 level in only one sample: 
M, CF, C, H, color-naming, confabulation, 
ambiequality, introversiveness, extratensive- 
ness, and W:M ratio. 

It may be concluded from the findings that 
these variables serve to differentiate remitted 
from hospitalized schizophrenics in the di- 
rection that the theory has predicted. In this 
broader sense the null hypothesis may be re- 
jected by virtue of the validation of a series 
of specific predictions referring to the two 
groups. 

Excepting FSH analysis of other Rorschach 
variables for which no predictions were made 
did not yield consistent results for the two 
samples. The remitted group tended to 
show more FSH in both samples. The dif- 
ferences, however, were not significant. 

The Rorschach records in Sample I were 
scored by the Elizur scale for measuring 
anxiety and hostility (3). The remitted group 
scored higher on both variables, the differ- 
ences being significant at the .1-.2 level for 
anxiety, at the .02-.05 level for hostility. 
However, since five cases in the remitted 
group accounted for more than half of the 
total hostility score for that group, the dif- 
ference for this variable may not be as sig- 
nificant as it appears. 


Clinical and Historical Variables 


Ninety-one patients from Sample I were 


rated on go clinical variables. Although a 
four-point rating scale was used, almost all 
the comparisons were on the basis of the 
presence or absence of a variable. The ma- 
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terial for rating was drawn from the clinical 
folders of the patients. The items were 
mainly of a factual, informational nature; for 
instance, reported delusions, hallucinations, 
frequency of heterosexual relations, etc. The 
ratings for all g1 cases were done by the 
writer after preliminary check showed an 
agreement of 95 per cent between raters. 

In comparing the remitted with the hos- 
pitalized group in Sample I, differences sig- 
nificant at the .o5 level were found on only 
seven out of go variables. Since four to five 
such differences could be expected to occur by 
chance, these findings should be interpreted 
with caution. 

The remitted group showed more evidence 
of the following: 


. Drinking 
. Headaches 
3. Vertigo 
. Weakness 
. Enuresis in childhood 


The hospitalized group showed more evi- 
dence of: 


1. Previous electric convulsive therapy 
2. Previous shock treatment of any kind 


As a group schizophrenics reported very 
few cases of psychosomatic illness, few overt 
sexual perversions, and almost no stupor or 
mutism. The patients of both groups failed 
conspicuously to achieve a satisfactory marital 
adjustment. There were few reported in- 
stances of physical separation from parents 
during infancy or childhood in either group. 
While remitted patients tended to report 
more childhood behavior problems, neither 
group was characterized by childhood delin- 
quency to any marked extent. 

On most of the key variables the two 
groups were more similar than divergent. 
The similarities on the clinical and historical 
variables were in sharp contrast with the dif- 
ferences found on the psychological tests. 

One note of caution must be inserted. 
Many significant variables were not measured 
because of lack of information regarding 
them or the lack of some means of rating 
them reliably; for instance, mood and affect 
at the time of admission, family history of 
psychosis, nature of psychological relation- 
ship with the parents, etc. 
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Discussion 


In this study remitted and hospitalized 
schizophrenics were sharply differentiated 
with respect to the length of the present 
hospitalization. The findings suggest that 
the longer a patient remains hospitalized, the 
less favorable are his chances of recovery. 

Remitted and hospitalized schizophrenics 
did not differ significantly with respect to 
age, education, occupation, marital status, re- 
ligious affiliation, diagnostic subtype, and 
previous number of hospitalizations. 

The two groups did not differ significantly 
above chance on a wide variety of clinical 
and historical variables. 

At the time of admission Wechsler-Belle- 
vue I.Q. scores of patients who later remitted 
were higher than those of patients who did 
not subsequently remit. The findings sug- 
gest that the remitted group was slightly 
superior to the hospitalized group in intel- 
lectual functioning at that time. In view 
of the comparatively small differences and 
the large amount of overlap between the 
groups, no definitive statement can be made 
concerning the relationship of I.Q. to remis- 
sion. 

Assuming the validity of Rorschach test 
variables, the Rorschach findings suggest that 
the remitted group showed greater clarity of 
associative processes and a greater ability to 
perceive stimuli in the same manner as nor- 
mals. Remitted patients appeared to have 
richer inner lives as well as greater ability to 
adapt their thinking to that of others. Their 
emotional reactions tended to be less unstable 
and more appropriate than those of nonremit- 
ting patients. They seemed to be better able 
to control or to prevent direct expression of 
impulses. emitted patients appeared to 
have a greater capacity to attain goals they 
set for themselves, i.e., a more realistic level 
of aspiration. As a group they showed fewer 
pathological signs on the Rorschach. In gen- 
eral, at the time of admission the schizo- 
phrenic disturbance appears to have been 
less deep seated in patients who later remit- 
ted than in patients who remained hospital- 
ized. 

Summarizing the above findings, the su- 
periority of the remitted group on Wechsler- 
Bellevue I.Q. scores and on such Rorschach 
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variables as M, F+-°%, P, FC, H, ambiequal- 
ity and introversiveness of experience-balance 
indicates that the remitted patients had 
greater personality resources for dealing with 
their problems and greater ability to continue 
functioning in the face of stress. The greater 
number of CF, C, confabulatory, contami- 
nated, and position responses in the hospital- 
ized group may be indicative of the extent to 
which maladaptive disease processes had pro- 
gressed in these patients even at the time 
of admission. 

In perceptual terms the results may be 
stated as follows: The remitted group gave 
more responses on those variables which ap- 
pear to be indicative of perceptual control, 
while the hospitalized group produced a 
greater number of responses on those vari- 
ables which appear to be related to a loss of 
perceptual control. 

The implication of these findings for the 
treatment of patients is that in a supportive 
situation in which external precipitating 
stresses are to a large extent removed, patients 
with better intellectual and emotional func- 
tioning will eventually recover sufficiently to 
leave the hospital. What these patients seem 
to need for recovery is mainly a favorable 
environmental situation in which the pa- 
tients are helped to regain their health. 

In the hospitalized group lessening of the 
stress in a protective hospital environment 
did not lead to remission. One of the rea- 
sons for this failure may have been the com- 
parative inadequacy or degree of impairment 
of personality functioning of patients in this 
group. While it is not the only factor in re- 
covery, the intactness of the individual’s per- 
sonality is certainly one of the most impor- 
tant. To this writer it is doubtful whether 
shock therapy, environmental manipulation, 
or psychotherapy will be of lasting benefit 
unless the individual has the personality re- 
sources to utilize them. 

There is considerable overlap between 
groups on the tests. This is due in part to 
errors of measurement. However, a good 
deal of overlap may reflect similarities in per- 
sonality makeup among members of the hos- 
pitalized and remitted groups. There are 
probably a considerable number of hospital- 
ized patients who resemble remitted patients 
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more closely than they do patients in their own 
group. Nevertheless, they remain hospital- 
ized. The explanation for their failure to re- 
mit may lie in other factors, such as poor 
home relationships, unfavorable environmen- 
tal situation outside the hospital, improper 
treatment in the hospital, or somatic factors. 
In any event, it seems unlikely that one set 
of factors accounts for all recoveries or fail- 
ures to recover. 


SUMMARY 


1. This study was concerned with the rela- 
tion of certain personality and intellectual 
variables, as measured by the Wechsler-Belle- 
vue and Rorschach tests, to remission in 
schizophrenia. Predictions with respect to 
differences between remitting and nonremit- 
ting schizophrenics on certain test variables 
were derived from the theoretical assump- 
tions underlying these test variables and tested 
on two samples. 

2. On the Wechlsler-Bellevue all three pre- 
dictions concerning differences in I.Q. scores 
were confirmed in both samples. 

3. On the Rorschach all predictions were 
confirmed in both samples. Remitted patients 


gave more M, FC, TSH, P, H responses. 
They showed a more introversive and a more 
ambiequal experience-balance, higher F+-%, 
higher extended F+°%, and a lower W:M 
Hospitalized patients gave more CF, 
C, color-naming, position, confabulatory, and 


ratio. 


contaminated They showed a 


responses, 
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more extratensive and a more coartated ex- 
perience-balance. 

4. The findings indicate that emotional and 
intellectual functioning are less impaired in 
remitting than in nonremitting patients. On 
the whole, test records of remitting schizo- 
phrenics tend more to resemble those of neu- 
rotics than do the test records of hospitalized 
schizophrenics. 
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THE INFLUENCE OF CULTURAL FACTORS ON THE LEITER 
INTERNATIONAL PERFORMANCE SCALE * 
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SYCHOLOGISTs are ever on the lookout for 

tests of general intelligence that will 

give valid and reliable measures uni- 
versally, tests that will minimize essentially 
nonintellectual handicaps (language spoken, 
geographical habitat, social background, etc.) 
yet yield an index of ability to function in- 
telligently—i.e., to think conceptually and ab- 
stractly. The Leiter International Perform- 
ance Scale (LIPS) has purported to be just 
such a tool.’ It claims a freedom from any 
required verbal communication, either oral or 
written, a content based on abstract con- 
cepts, and motor manipulation limited to a 
single simple operation. But we are neces- 
sarily skeptical of the claims of new tests until 
proved. The present research, therefore, set 
for itself the task of evaluating the hypothesis 
underlying the construction of the LIPS, 
namely, that the scale is relatively culture free 
and therefore unaffected by environmental 
influence. 

If the LIPS is more culture free than the 
verbal-type tests, it seemed reasonable to as- 
sume that Leiter test scores would show 
smaller differences between groups selected 
on the basis of cultural differences (socioeco- 
nomic status, preschool attendance, and in- 
stitutional residence) and less variability 
within a sample composed of such groups 
than would test results from a standard 
verbal test, the Stanford-Binet, 1937 Revision, 
Form L. 

Since the LIPS is a performance test, a sec- 
ond aim was to compare the Scale with a 
typical performance test. The Grace Arthur 
Scale of Performance Tests, 1943 Revision, 
Form I, was chusen because it has been used 
for research purposes somewhat more fre- 
quently than have similar scales and because 
it was suitable for the age group considered. 

Previous Studies. Wellman (14) discusses the 


1 This article is based on a doctoral dissertation 
written under the direction of Beth L. Wellman at the 
Child Welfare Research Station, State University of 
Iowa. 
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meaning of environment rather comprehensively, 
pointing out that each experimenter defines “en- 
vironment” specifically for his research, narrowly or 
broadly, according to his purposes. From her list of 
“major environmental constellations,” four factors 
were selected for control: socioeconomic status, edu- 
cation of the parents, preschool attendance, and 
institutional residence, primarily because previous 
research with the Stanford-Binet and the Arthur 
Scale had yielded fairly definite findings in these 
areas. ' 

Neff (11) reviews the numerous studies regarding 
the relationship between socioeconomic status and 
development of intelligence, and concludes that chil- 
dren of the lowest socioeconomic classes test about 
twerty points lower on the Stanford-Binet than chil- 
dren of the professional classes. Wellman (15) 
summarizes the various studies relating to the effect 
of preschool attendance upon intellectual develop- 
ment. Although such studies are not in complete 
agreement, the majority report an increase in IQ 
accompanying preschool attendance. 

Skeels (12), summarizing the Iowa studies which 
have found that institutional residence tends to de- 
press the intellectual development of children, 
quotes a mean IQ of 89.7 for 42 five-year-olds in 
residence in the Iowa Soldiers’ Orphans’ Home. 
This same orphanage, under a changed name, fur- 
nished the orphanage group of the present research. 

Research employing the LIPS has been, to date, 
primarily concerned with other cultural groups. 
Craig (3), Molino (10), and Michea (9) have stud- 
ied the responses of Mexican children on the LIPS. 
Darby (4) and Goulard (5) employed the Scale in 
studying Japanese Americans. As a result of 
Madeley’s (8) research, the upper levels of the LIPS 
were revised to their present form. 

Until the recent restandardization work by Arthur 
(2) and the present research, very little had been 
done with the LIPS at the lower age levels. Wil- 
liams (16) did a comparative study between the 
Stanford Binet, L, and the LIPS on fifty children 
ranging from six to ten years, inclusive. 

Subjects. One hundred and eight children, 53 
boys and 55 girls, ranging in age from four’ years 
and eleven months to six years, were tested during 
this research. Ninety of the children were residents 
of a midwestern town, Iowa City, Iowa. The oth- 
ers were resident in an orphanage sixty miles away. 

Group 1, the professional preschool group (here- 
after designated by the abbreviation PP) was com- 
posed of children enrolled in the Preschool Labora- 
tories of the State University of Iowa Child Wel- 
fare Research Station from one to thirteen semesters 
(including summer sessions); only four of the 
thirty had had fewer than three semesters’ experi- 
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ence. Fathers of these preschool children were all 
either of the professional or managerial occupational 
groups. Half held either a PhD or an MD degree, 
the mean educational level being 17.8 years (the 
range 15 to 19 years). The mothers of the PP 
group averaged 15.5 years of education, with 17 
holding bachelor’s degrees and an additional five 
with graduate degrees. 

Groups 2 and 3, the kindergarten professional 
group (KP) and the kindergarten lower socioeco- 
nomic group (KL), respectively, were drawn from 
the kindergarten enrollment in the public schools 
of Iowa City. The occupational and educational 
level of the parents of the KP group approximate 
those of the PP group. The fathers of the KL 
group ranged in occupational level from skilled 
tradesman (e.g., mechanic) down to day laborer, 
unskilled. 

Group 4, the orphanage group (OL), consists of 
eighteen children resident in the state orphanage, 
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Testing procedure. The tests were administered 
in surroundings familiar to the child and as 
physically close to his classroom as possible with- 
out violating good testing procedure. Only one 
test was given in any half-day session. The in- 
terval between administration of the first and third 
tests ranged from two days to fourteen days (except 
for one 26-day interval in the OL group), with 
a mode of three days and a mean of 4.96 days.” 

The tests were administered in all six of the pos- 
sible sequences. Subjects, within their respective 
groups, were randomly assigned to these in such a 
way that five children in each of the larger experi- 
mental groups received the tests in any given order. 
With the OL group, of course, the number was 
not sufficient to do this, and the subjects were as- 
signed to the sequences in a random fashion. 

The Arthur Scale was administered in an abridged 
form, omitting the Casuist Formboard and the Mani- 
kin-Feature-Profile Test, in accordance with Arthur’s 


TABLE | 


ANALYSIS OF VARIANCE 


Sourcr DEGREES OF 


FREEDOM 


SuM oF SQUARES MEAN SQuaRE 





Socioecon. Groups 
Tests 

Interaction (T x G) 
Within cells 

Total 


18868.24 
13799.52 
138.98 
274.31 


56604.72 
27599.05 

833.87 
85037.64 





@? tests 
=—>- -=50.31 df 2, 

o? w.cells 

2 group 
F= <. a 

o* W.cclus 

the Iowa Annie Wittenmyer Home, at Davenport, 
Iowa. These children are not, for the most part, 
orphans in the usual sense, but are primarily neg- 
lected and dependent children placed in t.e Home 
by court order. They come from the poores: homes 
in the state, representing parental occupations rang- 
ing downward from semiskilled labor to unem- 
ployed. The educational level of the parents in the 
KL group is significantly higher statistically than 
that of the parents in the OL group. 

The subjects were selected according to predeter- 
mined criteria. The PP and OL groups included 
every child available meeting the criteria. The two 
kindergarten groups were filled by taking every 
child who met the criteria in a given class before 
sampling a new one. Usually testing progressed 
concurrently in two kindergarten classes, because 
sessions are for a half-day only. Subjects were drawn 
from seven different classes in five schools. These 
K group children were selected, first, according to 
parental occupation and, second, in order to obtain 
a balanced sex ratio and age range. 

Two subjects were discarded from the KP group 
because in one the family moved out of town and 
in the other the child became ill. Test data are 
incomplete for two other subjects in the KP group; 
in both, Grace Arthur scores are missing owing to 
illness before the Christmas holiday. 


310; significant beyond . 


68.78 df 3, 310; significant beyond . 


o1 level of confidence. 
o1 level of confidence. 


recommendations for this abridgment (1, pp. 58-63). 
Although Leiter (6) recommends that his test be 
given without verbal intercourse during the adminis- 
tration, to avoid effecting an artificial situation which 
might depress test performance, the examiner 
neither encouraged the child to talk nor expressly 
forbade his doing so; however, when help from the 
examiner was solicited, the standard answers, “You 
do it the way you think it should go,” or, more 
explicitly, “The game is for you to decide what to 
do. Go ahead the way you want to do it,” were 
the only acceptable ones. From a clinical point of 
view, it was interesting to note the extreme varia- 
tion in the apparent need of the child to maintain 
verbal contact with the examiner. In all other re- 
spects, an attempt was made to adhere to the letter 
of the author's directions for administration. 


RESULTS 


Table 1 presents an analysis of variance 
when all the test scores on all the groups are 


2 These intervals include both the first and last testing 
day, not the difference between dates. Thus, test 
administration over November 2, 3, and 4 is classified 
as three days. 
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considered together. Because the interaction 
variance appears unstable, being by chance 
smaller than the variance within cells, the 
latter was used as the error term in comput- 
ing the F-tests based on this table. The vari- 
ance between groups is significantly greater 
than chance as one expects it to be, inasmuch 
as the groups were selected according to ex- 
tremes of variables known to be related to 
some of the measures employed. This F-test 
merely demonstrates that the hypothesis that 
the groups are random samples from a single 


In outlining the problem it was postulated 
further that, if the LIPS were relatively cul- 
ture free, the variability of Leiter scores 
within a sample composed of extreme socio- 
economic groups would be less than that of 
the Stanford-Binet scores on the same sample. 
Accordingly, independent variances were 
computed and the F-test between the sum of 
squares SB-L (446.37) and the sum of squares 
LIPS (444.45) was made (df 107 and 107). 
Although the variance for the Stanford-Binet 
was greater than that for the Leiter, the dif- 


TABLE 2 


Rance, MEANs, STANDARD DeviaTIOoNs AND STANDARD Errors OF THE MEANS OF PERFORMANCE AND 


INTELLIGENCE QUOTIENTS FOR 


THE EXPERIMENTAL GROUPS 








PP 
(N=30) 


EXPERIMENTAL GROUPS 
KL Tora 
(N=30) (N=108) 





Stanford-Binet, L 
Range 110--166 
Mean I.Q. 128.6 
sD 11.7 
ou 2.1 


Arthur 
Range 81-172 
Mean P.Q. 121.0 
SD 18.9 14-5 
ou 3-4 2.6 
Leiter (orig.) 
Range 81-124 


Mean I.Q. 105.97 


SD 10.4 
ou . 1.9 


Leiter (rev.) 
Range 77-148 87-129 
Mean I.Q. 110.8 110.9 
SD 17.5 10.1 


87-140 69-166 
105.1 114.3 
13.5 18.8 
2.5 ° 1.8 


64-132 58-172 
109.8 112.2 
18.4 20.4 
3.4 4-2 1.96 


64-114 58-94 58-136 
82.40 73.2 93.8 

13.4 11.3 19.1 
2.4 2.7 1.8 


71-118 65-100 65-148 
89.0 80.3 99-7 
12.7 11.0 18.2 





* Two cases of the KP group lacked data on the Arthur Performance Scale; accordingly, the N and df 


were 28 and 56 or 44, respectively, in comparisons where 


population can be rejected with great confi- 
dence. 

The mean scores for the experimental 
groups on the three tests, the Stanford-Binet, 
Arthur Performance Scale, and the Leiter 
Scale, are presented in Table 2. The ¢test 
was applied to these means, comparing each 
group with every other group. All three tests 
differentiated between all groups except that 
of the PP to KP comparison. None of the 
tests differentiated significantly between the 
two professional groups. The expectation 
that the LIPS would show smaller differ- 
ences between the experimental groups was 


not fulfilled. 


the Arthur KP scores were considered. 


ference was not significantly greater than 
chance. On the other hand, the variance 
of the Arthur Scale scores for the same 
sample (295.72) was smaller than that of 
the LIPS scores, and significantly so at the 
5 per cent level of confidence. (F=1.503, 
df 107 and 107). 

Since the variance for tests was significantly 
larger than chance, it was appropriate to test 
the significance of the difference between the 
mean IQ obtained on the Stanford-Binet and 
the mean IQ obtained on the Leiter for the 
entire sample. The ¢-tests between test means 
were computed similarly for each pair of tests 
in each experimental group and for the total 
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sample. The ¢ between the mean Stanford- 
Binet IQ and the mean Leiter IQ was sig- 
nificant beyond the 1 per cent level of con- 
fidence, being approximately six times as 
large as the required ¢ for 213 degrees of free- 
dom. Thus, the statistical test confirms the 
empirical observation that the intelligence 
quotients yielded by the two tests cannot be 
equally valid measures of the same variable. 

Similarly, the differences between the IQ's 
on the Leiter and on the Arthur, both with 
respect to the experimental groups separately 
and to the sample as a whole, were statis- 
tically significant beyond the 1 per cent 
level of confidence. But, except for the PP 
group, the differences between the Arthur 
and Stanford-Binet results were negligible. 

The order and magnitude of the mean 
IQ’s on the Stanford-Binets for the groups 
were about that anticipated from previous re- 
search, i.¢., the preschool group averages 4.5 
IQ points higher than their peers from com- 
parable home backgrounds. Although this 
difference in means is not statistically sig- 
nificant, the advantage is in the expected di- 
rection. The KL groups, representing a mid- 
dle to lower middle class environment, test at 
a mean average level of intelligence on the 
Binet. The mean IQ of 90 for the orphan- 
age group is consistent with previous studies 
of children from the same environment. 

The findings with respect to the Arthur 
Performance Scale I, Abridged, again indi- 
cate that the two professional groups are 
almost identical, with the lower two groups 
obtaining means of the same order and mag- 
nitude as the Stanford-Binet results. Be- 
tween these two tests, the Arthur and the 
3inet, the only statistically significant differ- 
ence is that in the PP group; for this group 
at least, the two tests apparently measure dif- 
ferent abilities. 

The measures of dispersion and central 
tendency with respect to the LIPS presented 
in Table 2 are expressed in terms of two 
scoring systems, those set forth by Leiter 
in the manual for administering the test and 
that scoring system advised by Arthur (2) 
on the basis of a restandardization of the 
test on “middle class American” children 
from a midwestern community. The Arthur 
standardization group consisted of 289 chil- 
dren, of whom 56 were between 5.0 and 5.99 
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years. She describes her group as mainly 
coming from homes classified occupationally 
as skilled and semiskilled labor. Unfortu- 
nately, Arthur does not report mental ages, 
1Q’s, or statistics on the Leiter, except as 
combined with the Revised Form II of the 
Arthur Point Scale of Performance Tests. 

Because Arthur’s report was not published 
until after much of the testing had been com- 
pleted, it was impractical to initiate her pre- 
scribed changes in the testing procedure. The 
test records were rescored, however, accord- 
ing to the following: 

1. To basal mental ages at the II and 
III year levels, six months’ credit was added. 

2. To basal mental ages at or above the 
IV year level, three months’ credit was 
added. 

3. If a basal mental age was obtained at II 
or III, tests passed at the four-year level were 
credited with 2.25 months instead of 3 
months each. 

Even with the revised scoring, the mean 
IQ for the sample in this study is 14.8 points 
below that obtained on the Stanford-Binet. 
The discrepancy between the mean IQ’s (by 
the original scoring system) for the sample 
of 108 is substantially greater than the dis- 
crepancy of 11.8 IQ points in favor of the 
Stanford-Binet reported by Williams (16), 
and is contradictory to Williams’ observa- 
tion that the test items at these younger 
age levels were more nearly alike on the 
two tests. Although the Arthur scoring raises 
the IQ’s, the differences between the tests 
remain too large to be attributed to chance 
alone. The discrepancy between the revised 
LIPS IQ and mean Stanford-Binet IQ was 
9.7 for the OL group, 16.1 for the KL group, 
13.1 for the KP group and 17.7 for the PP 
group. 

Pearson product-moment correlations be- 
tween the three tests were computed accord- 
ing to both scoring systems of the LIPS. As 
anticipated, our group yielded higher correla- 
tions between the Binet and the LIPS (++.77 
and +.81, for original and revised scores, 
respectively) than that reported by Williams 
(+.67, original scoring). This finding may 
be due, as she hypothesized, to the abilities 
tested being less specialized, and the Binet 
items, at the lower levels, being more similar 
to the content of the LIPS. It might also be 
due to a larger number of cases, a more 
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limited chronological age range, and a wider 
range of obtained mental ages. 

We have already indicated that the Leiter 
and the Stanford-Binet cannot be regarded 
as equally valid measures of the same vari- 
able, if one considers only the 1Q’s. But the 
relatively high Pearson product-moment cor- 
relation (-+-.81) suggests that if test results 
were expressed in comparable terms, such as 
standard scores, the two tests would indeed 
be measures of the same variable. Similarly, 
the correlations between the Grace Arthur 
and the Leiter (+.80) and between the 
Arthur and the Binet (+.75) indicate that 
these tests are, at least in part, testing the 
same thing. 

In order to estimate the reliability of the 
Leiter Scale, Hoyt’s analysis of variance pro- 
cedure, as outlined by Thorndike (13), was 
employed. The coefficient or index obtained, 
based upon the consistency of individual per- 
formance upon test items, was +-.75. The 
split-half method of estimating reliability 
usually results in a higher index, according to 


Thorndike. 
SUMMARY 


In an attempt to evaluate the relative free- 
dom from cultural influence of the Leiter 
International Performance Scale, that test and 
two other tests of intelligence, the Stanford- 
Binet, Form L, and the Grace Arthur Scale 
of Performance Tests, were administered to 
108 five-year-olds. These subjects, 53 boys 
and 55 girls, constituted four distinct groups: 
an upper socioeconomic group with preschool 
experience, an upper socioeconomic group en- 
rolled in kindergarten with no preschool ex- 
perience, a lower socioeconomic group en- 
rolled in kindergarten with no preschool ex- 
perience, and a lower socioeconomic group 
from a state orphanage. 

The variance of all test scores for all the 
groups was analyzed, and F- and ¢-tests were 
made of the differences between the test 
means for the experimental groups. 

The main findings were: (a) All tests dif- 
ferentiated significantly between all pairs of 
the experimental groups, except between the 
two professional groups; (4) the LIPS ap- 
peared no more culture free than either the 
Binet or the Arthur; (c) the LIPS means 
for all groups Were consistently smaller than 
those of either the Arthur or the Binet with 


the magnitude of means on the latter two 
tests approximately equal for any given 
group; (d) the LIPS correlated as highly 
with each of the other tests as those tests 
did with each other. 

The general conclusion is that the LIPS, 
though probably no more free of cultural in- 
fluence than the Binet or the Arthur, is a 
valid, useful instrument for measuring intelli- 
gence at the preschool level, but that it seri- 
ously needs a restandardization or revision of 
published norms, notwithstanding the recent 


work of Arthur. 
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PROGNOSTIC INDICES IN THE RORSCHACH RECORDS OF 
HOSPITALIZED PATIENTS * 


GORDON FILMER-BENNETT 
Worcester State Hospital 


NvesticaTions of the Rorschach records of 
behaviorally disturbed individuals have 
revealed specific variations in performance 

which appear to correlate highly with treat- 
ment-outcome. Among studies of psychotic 
records (8, 12, 13, 15) there has been general 
agreement concerning the predictive value of 
color responses, with more limited acceptance 
of such other pertinent indices as movement 
responses, achromatic responses, and the tend- 
ency to withhold doubtful interpretations. 
Findings obtained from the Rorschachs of 
nonpsychotic hospitalized patients (3, 4, 6, 
9, 17) reveal some similarities to the fore- 
going in the form of an emphasis upon color, 
shading, or movement responses. Other fac- 
tors correlating with the outcome of treat- 
ment which have appeared in studies of both 
psychotics and nonpsychotics include card 
rejections, location responses, and nature of 
content. 

Since Rorschach indices associated with 
psychotics were also found in some instances 
among the nonpsychotics, the question arises 
as to whether there are reactional tendencies 
common to both the psychotic and nonpsy- 
chotic states which differentiate those who 
later improve from those who fail to improve. 
If the Rorschach be an appropriate medium 
for the expression of such behavior tenden- 
cies, then a consideration of the protocol 
should provide the answer. 

The purpose of the present study was to 
examine pretreatment Rorschach records with 
a view to isolating performance variables 
which may distinguish the improved from 
the unimproved, regardless of psychiatric 
classification. 


SuBJECTS 
The pretreatment Rorschachs examined were those 
of 78 inpatients admitted to the University of 


1 Based on a doctoral dissertation submitted to the 
Graduate School of the University of Pittsburgh. Grateful 
acknowl is given Dr. Gerald R. Pascal under 
research was conducted. Special 
Dr. George Albee who assisted in 


dgement 
guidance the 
thank ir also duc 
the validation 


whose 


study. 


Pittsburgh’s Western Psychiatric Institute prior to 
July 1949. Of these patients, 55 were diagnosed as 
psychotic and 23 as nonpsychotic, all without 
organic involvement. The psychotics included 30 
schizophrenics, 15 manic-depressives, 5 involutionals, 
and 5 other types, while the nonpsychotics included 
14 psychoneurotics, 4 psychopaths, 3 primary be- 
havior disorders, and 2 alcoholics. Treatment dur- 
ing the period of hospitalization drew upon 
the usual physical therapies, such as metrazol, 
insulin, and electroshock, as well as upon psy- 
chotherapy. Some patients received shock treat- 
ment alone, some psychotherapy alone, while several 
underwent intensive therapy combining both forms 
of treatment. Several others were accorded no 
specialized treatment, but instead were subjected 
to more routine procedures, including occupational 
therapy and hydrotherapy. A number of patients 
were included who had been treated before enter- 
ing the Institute, the condition being that such 
treatment must have been terminated at least 
cight weeks prior to psychological testing. 

Follow-up data concerning the patient's post- 
treatment adjustment one year. or more after his 
departure from the hospital were obtained through 
the hospital agency, social worker, or therapist in 
charge, evaluation of which resulted in a total of 
22 improved, 21 moderately improved, and 35 un- 
improved patients. Average date of follow-up was 
slightly more than two years beyond the period of 
hospitalization. 

A patient was considered improved if the 
follow-up indicated that he was able to get along 
continuously with others and to carry on in a 
suitable full-time occupation, preferably that in 
which he was previously engaged. In most in- 
stances the patient’s social environment was identical 
with that existing previously. Moderate improve- 
ment signified a continuing mild disturbance of 
social relationships, but a satisfactory vocational 
adjustment, or else a relatively good social adjust- 
ment but difficulty in handling vocational problems. 
Lack of improvement referred to continued hospi- 
talization, following a return to the Institute or 
transfer to another mental institution. Also included 
within this category were all whose behavior was 
essentially unchanged, who were unemployed, but 
who at the same time remained outside the hospital 
owing to a modified attitude on the part of rela- 
tives or because of a reduction of previous demands 
upon the patient. 


PROCEDURE 


The present investigation was comprised of (a) 
an exploratory study which attempted to derive 


certain Rorschach variables correlated with treat- 
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ment-outcome, and (4) a validation study con- 
cerned with testing these variables. 

In setting up the exploratory groups, 12 improved 
patients were individually matched with 12 unim- 
proved in terms of the following variables: age, sex, 
marital status, educational level, extent of prior 
hospitalization, nature of treatment received during 
prior hospitalization (whether shock or nonshock), 
nature of present treatment, and the psychotic 
versus nonpsychotic composition of the groups. 
Mean chronological ages for the improved and 
unimproved groups were respectively 33.2 (SD 10.0) 
and 34.5 (SD 8.8) years, while corresponding 
mean IQ’s were 100.3 (SD 12.2) and 101.6 (SD 
12.2). Probability values deriving from a #test 
of the differences between the two groups in re- 
spect to both these variables exceeded .10. Also, 
chi-square evaluation of differences governing the 
remaining list of variables yielded probabilities 
greater than .20. 

A comparison of the two exploratory groups was 
then made with respect to differences in their pre- 
treatment Rorschach responses. First to be checked 
were some of the more common prognostic signs 
cited in the literature, including diffcrences relating 
to productivity, location, movement, color and 
shading, human and popular content, among others. 
Next came an examination of such differences in 
Rorschach behavior as were peculiar to our two 
groups, as well as possible differences in Rorschach 
patterns depicting anxiety (7, 10, 16), homosexu- 
ality (18), amount and direction of aggression (1), 
attitudinal changes associated with movement re- 
sponses (14), and extent of depression (10, 16). 
3eck’s standards (2) provided the basis for scoring, 
except for a supplementary scoring of the deter- 
minants according to Klopfer (11). Selection of 
the most promising variables from the exploratory 
groups was by way of the chi-square test, the 
resulting probability values serving solely as screen- 
ing measures of relative rather than absolute sig- 
nificance. Only those variables differentiating to 
the extent of a probability of .20 or less were ac- 
cepted. 

The selected variables were next applied to the 
test groups in a “blind” sorting of their pretreat- 
ment Rorschachs by a qualified psychologist, the 
records being randomly arranged as regards treat- 
ment outcome and identified only by code numbers. 

There were three test groups comprised of 10 
improved, 21 moderately improved, and 23 unim- 
proved patients; these constituted the remainder 
of the pool from which the exploratory groups 
were drawn. Differences pertaining to these groups 
were negligible except as regards marital status, 
nature of prior therapy, and sex. Respective proba- 
bility values describing these differences among the 
test groups were .o1, .05, and .o9. 


RESULTS 


Exploratory study. Of the more than 60 
Rorschach signs examined, only nine were 
found to differentiate between the two ex- 


593 


ploratory groups to the extent of a chi-square 
probability of .20 or better. The five signs 
associated with a favorable prognosis, in order 
from most to least significant, included (a) 
a relatively greater number of above-the- 
median responses to the three all-color cards, 
(4) card criticism, (¢) seven or fewer content 
categories, (¢) H, (human-gender responses 
comprising 15 per cent or more of the total 
R), and (e) M>(FM+m). The four 
unfavorable signs in corresponding order of 
importance were (a) K-type responses (K, 
KF, FK), (6) sum of chiaroscura responses ” 
greater than 0.5, (¢) W-repetition (two or 
more whole [W] location responses to any 
one card), and (d@) color “drive” score (13) 
greater than zero. 

None of the foregoing variables by them- 
selves was sufficiently discriminating to war- 
rant checking their validity against the test 
groups. But selection based upon a constella- 
tion of signs yielded more promising indices. 
Thus, M>(FM-+m) or card criticism or 
H,, unaccompanied by either W-repetition or 
K-type responses, portended a favorable re- 
sponse to treatment; on the other hand, 
W-repetition or K-type responses, regardless 
of the presence of any of the individual 
favorable signs, pointed to an unfavorable 
outcome. Respective probability values cov- 
ering these two sign constellations were .004 
and .oog. But again, these values were re- 
garded merely as screening measures, and 
cannot be taken at face value because of the 
inflationary effect resulting from the large 
number of comparisons involved, both actual 
and implied. 

Further analysis failed to discover any 
other signs bearing a relationship to treat- 
ment outcome. This was true of the separate 
reactions to Cards IV, VI, and VII, the so- 
called parent and sex cards, as well as of 
the Rorschach patterns symbolizing anxiety, 
homosexuality, depression, etc. None of the 
latter yielded probabilities better than .38. 

A comparison of the sixteen psychotic and 
eight nonpsychotic records in the two ex- 
ploratory groups revealed different prognostic 
trends among these sub-groups for some indi- 
vidual signs. For example, M>(FM-+-m) 


2 Sum computed on basis that form-dominant grays= 
0.§, grays dominant over form=1.0, and pure grays= 
1.5. All C’ responses were excluded. 
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and H, each showed a “marked” relation- 
ship * to treatment outcome_among the psy- 
chotics, but no relationship among the non- 
psychotics.* However, the two aforementioned 
constellations showed consistent and 
“marked” relationships to treatment outcome 
among both psychotics and nonpsychotics, 
while the only individual signs conforming 
in this respect were W-repetition, card criti- 
cism, and increased responses to the all-color 
cards. 

Validation study. The two most promising 
sign constellations were next applied to the 
54 Rorschach records comprising the test 
groups. That is, the rule set up to test 
empirically the validity of these criteria was 


sign 
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the sign constellation associated with favor- 
able outcome was held to be representative 
of both an improved and a moderately im- 
proved state. 

“Blind” predictions utilizing these criteria 
were about as accurate as psychiatric pre- 
dictions made shortly after hospital admis- 
sion (Table 1), over-all accuracy being re- 
spectively 43 and 49 per cent of the maximum 
possible. These results allowed for a liberal 
translation of psychiatric terms used in label- 
ling prognoses, the term “good” equally de- 
fining a prognosis of improvement and of 
moderate improvement. To have adhered 
strictly to definition or to have disregarded 
the so-called “guarded” prognoses altogether, 


TABLE |! 


ACCURACY OF THE Best 


THE 


RorscHacu SiN CoNnsTELLATIONS AND OF PsycHIATRIC JUDGMENTS IN RELATION TO 
Test Groups 


Number and Per Cent of Correct Predictions 


I predictions 
correct rschach predictions 
chiatric predictions 
*=unimproved) 
correct psychiatric predictions 


(“Guarded”=mod. improved) 


correct p 


(“Guarde 


* Psychiatric prognoses ot available for several 
as follows: A record containing one or more 
of M>(FM-+m), card criticism, or H,, 
but devoid of W-repetition or K-type re- 
sponses, points to a good prognosis; on the 
other hand, a record containing either of the 
two latter signs, regardless of the presence 
of any of the other signs, points to a poor 
For present purposes the term 
“good” prognosis was extended to include 


all degrees of improvement. Accordingly, 


prognosis. 


rds containing t 

in the other 

while falling short of the criterion 
evidence of a rever rognostic 
een the psychotics and nonpsychotics. 
j extensor M, (6) ex 
ness of generic rela 
Among the psychotics 
relationship to 
unimprovement Among 
howed a “marked” rela 


e direction 


rela- 


resence of 
} aware 


g bore a “marked” 
the fourth to 


chotics ¢ac h sign s 
the ' ’ j 


MopERATELY 


IMPROVED UniwpRre 


patients in these two groups. 


rather than attribute to them the separate 
meanings of moderate improvement and un- 
improvement, would have been to reduce the 
psychiatric predictions by as much as 35 per 
cent. 

In setting up our original hypotheses there 
were insufhcient data at hand for determining 
just how to classify records which showed 
an absence of any of the five sign components 
making up the two prognostic indices. The 
only record associated with the exploratory 
study which fell into this category belonged 
to the improved group. If, on the strength 
of such meagre evidence, the eight such rec- 
ords in the test groups were classified as in- 
dicating a favorable outcome, then the ac- 
curacy of Rorschach prediction would have 
been increased to 57 per cent. But even ad- 
mitting such unreliable data as evidence, 
the resulting difference in accuracy between 
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Rorschach and psychiatric prediction as herein 
defined is negligible. 

Table 2 indicates that had individual Ror- 
schach sign components formed the basis 
for predictions, then in some instances a 
degree of accuracy somewhat exceeding psy- 
chiatric predictions would have resulted. But 
it may be noted that the majority of such 
sign components functioned in a negative 
rather than in a positive sense inasmuch as 
the most accurate prediction of unimprove- 
ment was achieved by way of the favorable 
sign. Likewise, improvement was best pre- 
dicted by use of the unfavorable sign. 

Probabilities in excess of .30 described the 
interrelationship of successful prediction to 


existence of prognostic indices common to a 
variety of psychiatric disorders. On this 
basis alone a difference in findings would 
not have been surprising. 

Again, there is the matter of follow-up pro- 
cedure. Less than half the studies listed 
earlier sought to evaluate the patient’s prog- 
ress in terms of his later adjustment outside 
the hospital, and of these only two mentioned 
follow-up interviews being conducted as late 
as a year after treatment. The remaining 
studies gauged improvement solely in terms 
of how the patient reacted within the re- 
stricted hospital setting upon termination of 
treatment. On the average, follow-up status 
in the present study was determined about 


TABLE 2 


AccuRACY OF THE SIGN ComMPoNENTs WHEN Usep Sincty IN RELATION To THE Test Groups 


Number and Per Cent of Correct Predictions 


IMPROVED 





: 

No. of possible predictions 10 
correct, using M>(FM+m) or 
Hg or card criticism 

No. correct, using W-repetition or 
K-type responses 

No. correct, using M>(FM+m) I 

No. correct, using Hg 2 

No. correct, using card criticism 3 


No. correct, using W-repetition 7 (70%) 13 (62%) 


No. correct, using K-type responses 


5 (50%) 


6 (60%) 
(10%) 





MopERATELY 


IMPROVED UNIMPROVED 


21 23 
1r (48%) 


14 (61%) 
18 (78%) 
14 (61%) 
16 (70%) 
12 (52%) 
12 (52%) 


11 (52% 


rr (52%) 
5 (24%) 
3 (14%) 
7 (33%) 


17 (81%) 





those variables which were previously found 
to differentiate significantly between the test 
groups, namely, marital status, sex, and 
presence or absence of prior shock treatment. 
It would therefore appear that these variables 
played a negligible role in Rorschach and 
psychiatric predictions. 


Discussion 

The foregoing results failed to reveal any 
sharp trends in Rorschach performance which 
were significantly related to outcome of treat- 
ment or which compared with those reported 
in earlier studies. There are several possible 
explanations for this discrepancy. 

Most previous studies in this area have been 
restricted to either psychotics or nonpsy- 
chotics, or even to specific subtypes falling 
within these two general categories, whereas 
the present study set out to determine the 


two years following the end of the hospitali- 
zation period rather than merely following 
the end of the treatment period. That is, 
in the present instance evaluation rested upon 
the patient’s reaction to multiple environ- 
mental stimuli, many of which were apt to 
have been closely associated with the precipi- 
tation of breakdown, whereas in the case of 
progress notes pertaining to hospital adjust- 
ment consideration was restricted to behavior 
which was isolated from most such disturb- 
ing stimuli. Add to this fact differences in 
the time element involved, and it would seem 
that prognoses associated with these two 
distinct sets of conditions are not exactly 
comparable. It may be that immediate re- 
sponse to treatment is to be distinguished 
from long-term outcome. 

Another point for consideration is the fact 
of a difference in the statistical handling of 
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the data. Apart from the fact that some of 
the studies reported attempted to take into 
account individual performance by applying 
chi-square and other suitable measures, all 
failed to apply the rigorous treatment pre- 
scribed by Cronbach (5). Then again, some 
quoted probability values to which they as- 
cribed a significance which was wholly un- 
justified in view of the manner of derivation. 
The present study sought to avoid some of 
these errors by applying the appropriate pro- 
cedures. 

Finally, several of the studies were confined 
to the setting up of hypotheses relative to Ror- 
schach prediction, but without attempting to 
test their validity against a second sample. 

While some of the foregoing considerations 
may be overshadowed by real differences 
governing the reliability and representative- 
ness of the present and previous samples, 
this is a matter to be decided by further re- 
search. 


SUMMARY AND CONCLUSIONS 
With the pretreatment Rorschachs of 78 
hospitalized patients as data, this study in- 
vestigated possible variations in performance 
which might correlate with outcome of treat- 
ment and be at the same time independent 
of psychiatric classification. Prognostic in- 
dices were derived from an exploratory study 
which compared the Rorschach records of 
an improved and an unimproved group of 
12 patients each, matched in pairs. These 
indices were then tested against a total of 
54 records belonging to improved, moderately 
improved, and unimproved patients. 
The results support the following conclu- 
sions: 


1. There are no prognostic signs common to 
the Rorschach records of both psychotics and 
nonpsychotics which have the predictive ac- 
curacy claimed for the Rorschach signs re- 


ported in earlier studies. 


2. Rorschach prediction is about as accurate 
as psychiatric prediction. 

3. Discrepancies between present and past 
findings may be stated in terms of differences 
in the kind of sample used, in the extent 
and nature of the follow-up, and in the 
statistical handling of the data. 


Gorvon FitMer-BENNETT 


These conclusions do not deny the possible 
prognostic value of certain subtle intrarecord 
patterning which may be at present unsus- 
ceptible to quantitative validation procedures 
and discernible only to the highly skilled 
clinician. 
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THE USE OF A RATING SCALE OF LEVEL OF RESPONSE IN 
TAT STORIES 


DOROTHY TERRY 


University of North Carolina* 


the degree to which the stimuli elicit 

characteristic responses from the sub- 
ject. It seems plausible to assume that the 
projective value of a technique such as the 
TAT is a function, at least in part, of the 
subject’s degree of personal involvement. 
Coleman (1), in 1947, investigated levels of 
response in children’s TAT protocols. His 
different levels included static enumeration 
of objects and people in the picture, overt 
description of the card, and _ interpretation 
of feelings of people in the story. Differences 
were found in the average levels of response 
to the individual cards as well as among the 
subjects. 

This study presents the development of a 
somewhat similar rating scale of level of 
response in TAT productions of college 
women. Level of response is defined as the 
degree to which the subject is judged to have 
become involved in his story, as expressed 
through elaboration of plot and descriptions 
of emotional responses of the story characters. 
The purpose of the study was (a) to es- 
tablish a reliable, empirically derived scale 
and (4) to use this scale to investigate dif- 
ferences between written and oral test pres- 
entation, differences in the TAT cards, and 
individual differences among subjects. 


Ts value of a projective technique lies in 


DerIVATION OF THE LEVEL oF REsPONSE SCALE 


The rating scale method is particularly 
well suited to research with projective ma- 
terial, because it provides a quantitative 
measure of the rater’s clinical judgment of 
the material. The actual rating, or mark on 
the scale, may represent as subtle, holistic 
considerations as we may wish, as long as 
the criteria are explicit. Rating scales have 
often been used in studies of TAT produc- 
tion (1, 4, 5, 6), but the scales generally have 
been made up with predetermined categories. 

1 This paper represents part of a doctoral dissertation 


completed in 1950 at the University of Wisconsin under 
the supervision of Professor Ann Magaret. 


The resulting data, then, are discrete, and 
are not amenable to statistical manipulations 
assuming continuity of data. This disad- 
vantage has been avoided here by using an 
empirically derived scale where the original 
ratings are made on a continuum with no 
subdivisions. 

The first step was to give the 20 cards of 
the women’s series to 10 women sophomores 
taking the elementary psychology course at 
the University of Wisconsin. The cards were 
given in one session of one and a half to two 
hours rather than in the two sessions recom- 
mended (7). Five of the subjects wrote their 
stories, and five gave them orally. The 200 
stories obtained from these subjects were each 
rated on a 10-centimeter line with no sub- 
divisions.” Stories rated at the low extreme 
of the line involved only descriptions of the 
card, while the upper extreme for the group 
included discussions of motivation and anal- 
ysis of emotional responses of characters in 
the stories. Each story for each subject was 
rated on a separate line, and all of the stories 
for one card were rated before progressing 
to the next card. 

After the stories had been rated, they were 
grouped in five intervals, with each interval 
representing a two-centimeter segment of 
the line. Each level of response is described 
in terms of the stories which fall within that 
interval. Quotations from typical stories are 
included for purposes of further illustration. 
The placing of a story in a particular category 
is determined by the highest category which 
it fits: ie, if a story fits the description for 
both categories two and three, it would be 
placed in three. 

1. Description of people and objects in the 
picture. In present tense, no mention of past or 
future, and no mention of things or people not 
in picture, or of facial expression or feelings. 

2. Plot. Some mention of occurrences in the 
past or future (“She probably heard something and 


2 This same procedure was used by Eron, Terry, 
and Callahan (3) in deriving another rating scale for 
the TAT. 
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is running”). Story goes beyond picture in mention- 
ing events, objects, or people not included on card. 
Story (“It’s a lonely scene.” .. . 
“a homey atmosphere”). No mention of feelings 
people in story, but 


ood 


creates a 


or emotions of wondering, 

thinking, and expressions of curiosity and surprise 

may be included 
Social interaction. Also 

personal descriptions of struggles to 

or of one person helping another if 

mention of motivation or feelings. 


In general, stories primarily of description, with 


may be im 
“get along,” 
there is no 


included 


minor affect. 

3. Feelings I. A 
feelings in a superficial, “black-white,” movie-plot 
manner: feelings of pleasantness-unpleasantness, like- 
dislike, happy-sad (“she is heart-broken”), hate, 
anger, love, startle or shock (“She was numb with 
unbelief”), dreaminess, (“He often dreams 
that someday he will be famous”). Teller of story 
may feel strong emotion in a descriptive plot that 
would otherwise fall under two (“It’s a scene of 
horrible destruction”). 

Superficial character descriptions may be included. 
(“He looks like a straightforward, clean-cut, con- 
scientious young fellow.’ “She has a terrible 
temper.” . “She believes herself to be a witch.” 
... “He’s not afraid of much of anything.”) 

Social interaction with emotion expressed in a 
superficial way, without analysis (“This mother 
is telling her child something that is making her 
unhappy”). 

In general, feelings and personalities are men- 
tioned only briefly, with no analysis, and in terms of 


presentation of emotion or 


revery 


cliches. 

4. Feelings Il. More subtle and complex ex- 
pressions. People may be thoughtful, excited, con- 
tented, bewildered, fearful, sulky, self-pitying, re- 
sentful, threatening. (“He is torn.” .. . “His grief 
and self-reproach are so great that.” ... “They're 
two frustrated women living together.” . . . “She's 
too embarrassed or too engrossed in grief and 
self-pity to notice.” “He’s bewildered. He 
doesn’t know how he could have done it.”) 

Social interaction with more complex expression 
of emotion (“An understanding girl tries to con- 
sole him”) 

In general, the feelings are not analyzed, but they 
are less stereotyped than in three, and seem to 
have more meaning to the subject. 

{nalysis 
a very 


or description of character. (“He's 
little boy and have the 
it takes, but the violin 
thing to him.” ‘She doesn’t have the oomph 
it takes to get anywhere.” “She’s always alone 
and enjoys “She’s not unhappy; 
she just wants other things.” “She'd rather 
remain outside in the cold than go to a place 
that isn’t home.”) 

Social and emotional interaction. (“He's a fine 
He brought her out of the depths, and in 
“She’s wishing 
didn’t 
. “She 
wishes 


, : 
serious doesn t 


ambition will mean some 


being alone.” 


man 
return she’s kept him stable.” . 
her daughter could the chance she 
She's a little envious of her too.” 

bl but 


have 
have 
doesn't | 


ame herself for his death 
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‘d been nicer to him.” “They feel they 
really have a lot in common, but they can’t get on. 
He's longing to get away and she’s holding him 
with all her power.” ... “Her father was always 

‘ry easy on her and as a result she didn’t have 
much respect for him.” ... “Her parents are well! 
to do and don’t have much time for her. They 
don’t really talk to her.” . “She probably liked 
to bother her father because she resented his being 
sick, and all the attention people paid him.”) 

Stories include past history and development of 
an emotion; why one emotion is felt and not 
nother; changes that feelings have undergone; 

bivalence. 


Reliability was investigated by having 
another judge rate the stories, using the five- 
interval scale. The product-moment correla- 
tion between judges was 88. The size of 
any r must be evaluated in the light of the 
heterogeneity of the sample used, since it 
varies directly with the variance of the sample. 
It was suspected here that the total variance 
of this sample of stories might well have been 
seriously inflated by the heterogeneity of the 
cards, as indicated by a significantly large 
spread of the card means. In an effort to es- 
timate the importance of this effect upon the 
interjudge reliability an intraclass correlation 
coefficient (9, pp. 203-205) was computed 
from the ratings by one judge of the 600 
stories used in the main part of the study. 
The obtained coefficient was .20. If we con- 
sider r* as an indication of the proportion 
of the total variance accounted for by the 
heterogeneity of the means, we may say 
that (.20)”, or 4 per cent of the total variance 
of the ratings is due to differences in indi- 
vidual card means. 


APPLICATION OF LEVEL oF RESPONSE SCALE 
Procedure 

The continuous line method of rating de- 
scribed above was used with the protocols of 
30 other college women. Half of the sub- 
jects wrote their stories and half took the 
test orally. The ratings on these 600 stories 
were grouped in one-half-centimeter intervals 
to afford more continuous data than would 
the two-centimeter intervals of the scale just 
described. A value of one was assigned to 
the ratings in the lowest interval, and a value 
of 20 to the highest. The midpoint of this 
scale, then, falls between intervals 10 and 11. 

These data were put into a 202 square 
where the 20 rows represented the 20 cards 
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and the two columns the two methods of 
presentation, written and oral. Each sub- 
ject is represented in all rows of the column 
to which she was randomly assigned. The 
data then were subjected to an analysis of 
variance. 


Results 


Table 1 summarizes the results of the 
analysis of variance. The F-ratios for the 
two main variables, cards and presentation, 
and for the interaction term subjects within 


TABLE | 


SUMMARY OF ANALYSIS OF VARIANCE 








MEAN 


Source oF VARIATION df SQUARE 





Cards 19 30.92 

Presentation I 57.04 

Subjects within presentation 28 44.89 

Presentation X Cards 19 4.87 

Error 532 5.07 
Total 599 





*** Means F is significant at .oo1 level of confidence. 


presentation, are all significant at the .oo1 


level of confidence. The first term, cards, 
which refers to individual card differences, 
indicates that the 20 cards differ significantly 
in the level of response elicited. The term 
presentation refers to the written and oral 
test administrations. We find that the written 
stories are significantly lower in level of re- 
sponse. The term subjects within presenta- 
tion refers to the 15 individual subjects within 
each presentation and the F indicates that 
there are consistent individual differences; 
that is, each subject tends to respond charac- 
teristically at about the same level for each 
card. This also is another indication that 
the scale is reliable. The interaction term 
presentation Xcards is not significant; it is 
in fact less than one. This lack of signifi- 
cance indicates that the differences in rank 
order of cards for written and oral presenta- 
tion are not meaningful. In other words, 
the type of presentation did not have a dif- 
ferential effect on the individual cards. 
Table 2 presents the individual card means 
in terms of the 20-point scale, where 1 is the 
lowest level of response, and 20 the highest. 


5°9 
TABLE 2 


Mean Lever or Response Ratincs AND RANKS OF 
Eacu Carp For WRITTEN AND Ora STorigs 
ComBinepD (N= 3) 








MEAN 





16. 


14. 
10 
2 
19 
16. 





The rank order of each card is indicated in 
the third column of the table. A rank of 
one was assigned to card 12F, which has 
the highest mean rating, and a rank of 20 
was assigned to card 11, which has the lowest 
mean rating. The written and oral stories 
are combined for purposes of easier com- 
parison. This procedure is justified by the 
fact that the interaction term presentation X 
cards is not significant, and therefore the 
rank order of the cards is not affected by the 
type of presentation. 


Discussion AND CONCLUSIONS 

Derivation of the scale. The method by 
which the scale was derived has two ad- 
vantages which are rarely found in rating 
scales with this type of material: (@) the 
original ratings are continuous, so that the 
data can be analyzed by statistical methods 
assuming continuous measures; (4) the scale 
can be subdivided into any convenient num- 
ber of categories which can then be described 
in terms of the actual words used by the 
subjects. Thus the scale criteria are derived 
from the original data rather than from a 
priori definitions which might be found later 
to be inapplicable to the material. The op- 
timal number of categories depends upon the 
situation in which the scale is to be used. 
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The five-category subdivision is designed 
for use in the clinical situation. A greater 
number of intervals would probably increase 
the reliability of the scale, but also would 
make it more cumbersome to use. The 20 
subdivisions, as previously stated, were made 
for purposes of continuity. The method is 
useful, then, both for applied and for re- 
search purposes, and may profitably be used 
to investigate factors other than the one de- 
scribed here. 

Cards. Although the spread of card means 
is significantly large, when they are arranged 
in order of mean magnitude, no two ad- 
jacent card means differ significantly from 
each other. For this reason only the cards 
at the extremes will be discussed. 

Cards 12 and 18 have the highest average 
levels of response, 12 and 11.9, respectively, 
falling on the borderline between categories 
3 and 4. The stories for these cards often 
deal with violent emotions; those for card 
12 frequently mention evil influence, and 
the themes for card 18 generally involve death 
and murder. 

Cards 11 and 19 are particularly low in the 
average level of response elicited, with aver- 


ages of 8.4 and 8.5, respectively, placing them 
in the second category of the five-point scale, 
which is characterized by plots with no men- 


tion of feeling or emotion. These results 
tend to bear out Rotter’s assertion (8) that 
TAT pictures with no large central figures 
produce less personally meaningful material. 
In this group of subjects card 11 often elicits 
descriptions of mountain scenes, and card 19 
of snow scenes. However, since these two 
cards do occasionally produce stories of ag- 
gression and violence, it seems likely that 
their particular usefulness is with subjects 
who cannot express hostility in everyday situ- 
ations that are too close to them, but can 
express it in stories about dragons and violent 
storms. 

Presentation. The mean for all the written 
stories is 10.1, and that for the oral stories 
is 10.8. Both means fall within the third 
category of the five-point scale. This is the 
level where personal feelings are -first men- 
tioned. Inspection of the protocols suggests 
that the subjects who wrote their stories did 
not become as involved in the situation as 
did the subjects who told their stories. The 


Dorotny ‘TERRY 


written stories are rather obviously patterned 
after women’s magazine literature, and do 
not seem to reflect spontaneous, personal feel- 
ings and attitudes. Some support for this 
idea may be found in Eron and Ritter’s re- 
cent article(2). These authors find fewer 
alternate themes and more humorous themes 
in their group of written stories than in the 
oral stories. The process of writing may 
make for less spontaneity; a subject telling 
his stories may be likely to say the first 
thought that comes into his head, whereas 
he might edit and change that thought when 
confronted with the task of putting it in 
writing. 

Administering the TAT in written form to 
groups of subjects is economical, making 
feasible large normative studies which would 
otherwise involve an enormous amount of 
time and effort. The urgent need for such 
studies of the TAT may seem to outweigh 
the disadvantage that written and oral pro- 
tocols apparently are not strictly comparaole. 
On the other hand, the possibility of obtain- 
ing more superficial, stereotyped material in 
the written situation may lead us to under- 
rate the true clinical value of the TAT. Cer- 
tainly both types of administration can give 
us important information, but we must be 
careful not to generalize too freely from one 
situation to the other. 


SUMMARY 


If we assume that the projective value of 
the TAT is a function of the subject’s degree 
of involvement, then in order to investigate 
this relationship a method of quantifying the 
level of response is required. The degree 
of involvement, or level of response, was 
measured by an empirically derived rating 
scale. Scale reliability, as indicated by the 
correlation between two judges’ ratings, was 
88. The scale was used to investigate dif- 
ferences in level of response to the individual 
TAT cards, to written and oral test adminis- 
tration, and among the individual subjects. 

Forty women sophomore students in the 
elementary psychology course at the Uni- 
versity of Wisconsin were given the 20 cards 
of the women’s series. Half of the subjects 
wrote their stories, and half gave them orally. 
The 200 stories given by 10 of the subjects 
(5 written and 5 oral) were used in the pre- 
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liminary reliability study. The 600 stories 
given by the remaining 30 subjects were 
used in the main study. 

Each of the 800 stories was rated on a 
separate 10-centimeter line with no subdivi- 
sions, and with extremes ranging from a pure 
description of the picture, to detailed dis- 


cussions of the motivation and analyses of 


the feelings of the story characters. A five- 
category composite scale was made, where 
each scale interval was described in terms of 
quotations from the stories within the in- 
terval. This scale was derived for use in 
the clinical situation. For purposes of further 
analysis, a more finely subdivided scale with 
20 intervals was used with the continuous 
ratings for 600 of the stories. These data 
were subjected to an analysis of variance. 
The cards show significant differences in 
average level of response elicited. These dif- 
ferences are in accord with Rotter’s criteria 
for TAT pictures. The written stories show 
a significantly lower average level of re- 
sponse than do the oral stories, suggesting 
that the subjects writing their stories become 
less involved in the test, and thus produce 
less personal material. The subjects demon- 
strate consistent individual differences in level 


of response, a further indication of the relia- 
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bility of the story material. It is suggested 
that the procedure used in deriving the scale 
may profitably be applied to other investiga- 
tions attempting to quantify TAT produc- 
tions. 
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THE INFLUENCE OF VARYING DEGREES OF PSYCHOLOGICAL 
STRESS ON PROBLEM-SOLVING RIGIDITY * 


EMORY L. COWEN 


University of Rochester 


NDERSTANDING psychological stress is im- 
portant both for personality theory 
and for various forms of psychological 

practice including that of the educator, the 
psychotherapist, the physician, the social 
worker, the guidance counselor, and perhaps 
even the diplomat. It is a point of general 
agreement that to a greater or lesser extent 
there will be some alteration of behavior as 
a result of psychological stress. This gen- 
eralization applies to various forms of psy- 
chopathology (1), including neurosis (11, 18), 
and organic brain injury (9). 

In the present investigation we are primar- 
ily concerned with one highly specific aspect 
of the influence of stress on behavior. This 
concerns persistence, in an induced method of 
problem solution which no longer offers the 
most direct and efficient method of solving 
the problem under consideration, under vary- 
ing degrees of psychological stress. Such 
perseverative behavior, involving the inabil- 
ity to change one’s set, has generally been 
subsumed under the concept of “rigidity.” 

Earlier (5) it was noted that the term 
rigidity has been used diversely to describe a 
wide range of behaviors. Frequent absence 
of operational clarity in defining the con- 
cept has resulted in considerable misunder- 
standing, since workers have attempted to 
describe with the same label, behaviors which 
appear to be quite different qualitatively. 
Present usage of the term rigidity does not 
in any way imply a summarization or factori- 
zation of the multifarious use of the concept 
in the current literature. Rather, it is pro- 


posed in a specific sense, as an operational 
definition stemming directly from the be- 
haviors tapped by the criterion test used in 
the appraisal of the effects of psychological 


stress. 
It is equally important to clarify the usage 


1 The present paper is based in part on a dissertation 
submitted in partial fulfillment of the requirements for 
the Degree of Doctor of Philosophy, to the Graduate 
School of Syracuse University, in August 1950. 
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of the concept of stress, since this is another 
nondenotable variable which has been used 
quite differently to describe many situations. 
Stress may imply a mode of perception of a 
given stimulus situation. Consequently any 
constant situation may be perceived as dif- 
ferentially stressful for different individuals 
(owing to differences in personality structure, 
ego-involvement, etc.). Glixman (8) calls at- 
tention to two assumptions that anderlie any 
attempt to set up a stressful experimental 
atmosphere. 

1. The experimenter’s attempt to produce a 
threat to the subject’s self-esteem is, for most sub- 
jects, successful. 

2. The order of magnitude of stress as the ex- 
perimenter presents it represents the degree of threat 
to subjects. 


In the present study the final operational 
definition of stress is in terms of exposure 
to several controlled experimental conditions 
designed to induce perception of varying de- 
grees of psychological stress. In order to 
determine whether or not there were sys- 
tematic group differences in the perception 
of the stress situations, rating scales have been 
used to measure attitudes toward each of 
the experimentally induced atmospheres. 

The present condensed report does not at- 
tempt to review systematically the literature 
relevant to factors underlying and associated 
with psychological rigidity. A more detailed 
treatment of this phase of the problem is 
available elsewhere (4). Suffice it to note here 
that various studies at the human level (2, 2, 
10, 12, 13, 16, 20, 21) have used such diverse 
stress stimuli as electric shock, fear of failure 
on a test, frustration, sudden cold showers, 
time pressures, and anxiety resulting from the 
interpretation of a Rorschach, to bring about 
such rigid-like responses as perseveration, 
stereotyped behavior, loss of abstract ability, 
and delayed time of response to a problem re- 
quiring a new task approach. Two of these 
investigations (16, 20) have examined the in- 
fluence of time pressures on the specific type 
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of problem-solving rigidity being studied in 
the present experiment. 

In terms of the present operational defini- 
tions of stress and rigidity, the experimental 
hypothesis may be stated as follows: Increas- 
ingly stressful psychological atmospheres will 
tend to elicit increasingly rigid problem-solv- 
ing behaviors. 

METHOD 

The instrument used to measure rigidity 
as defined was the Luchins’ water-jar pro- 
cedure * (16). On the basis of a pilot study 
it was decided that only two set-building 
problems would be used. Thus it was hoped 
that a relatively weak set might be estab- 
lished, so that persistence in the use of this set 
when it was no longer appropriate might be 
related more readily to the stressful experi- 
mental atmosphere. The pilot study also sug- 
gested the use of a 2.5-minute time limit for 
each problem. The final test series of water- 
jar problems is presented in Table 1. 

Three groups of 25 Ss each made up the 
sample for that part of the experiment re- 
ported in the present paper. All Ss were 
college juniors and seniors, volunteers, who 
were drawn from classes in psychology, soci- 
ology, and education. No attempt was made 
to match Ss in the three groups; rather, the 
technique of random assignment to groups 
was employed. 

The independent variable in this investiga- 
tion was the amount of antecedent psycho- 
logical stress. One of the groups was a non- 
stress or control (C) group. The second was 
a mild stress (MS) group, while the third 
was a strong stress (SS) group. 

Selection procedures for the control and 
mild stress group were identical. The experi- 
menter (E) read a notice to several classes 
requesting volunteers for psychological re- 
search. Those who signed up were alternately 
assigned to the control and mild stress groups. 

For all Ss in these groups, the testing at- 
mosphere was an informal and friendly one. 
In the control group, E proceeded uninter- 
ruptedly through two practice problems and 
the test series. However, in the mild stress 
group, there was one important change in the 

2 Another measure of this same behavior (ic, an 
original alphabet maze) was also used in an exploratory 
way. Results with this instrument, while basically incon- 


clusive, were in the predicted direction. A fuller report 
of these data is presented elsewhere (4). 


TABLE 1 


Test Series oF WATER-JAR ProsLeMs 








Size oF Jars QUANTITY To 
AVAILABLE Be OBTAINED 


Zz 
+) 


Type or Prop_Lem 





1st Practice 10 3 7 
2nd Practice 4 13 
3 jar control 74 7 
ist Einstellung 71 19 
2nd Einstellung 81 17 
1st Crucial 51 12 
2nd Crucial 79 17 
3rd Crucial 90 21 
4th Crucial 65 
3rd Einstellung 57 
Exunction 65 
sth Crucial 75 
6th Crucial 49 
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° 


- 
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procedure. After completing the practice 
problems, and the first (control) problem of 
the test series, E introduced an unrelated 
problem, designed to be mildly stressful. This 
problem, which is presented in Fig. 1 was 
selected to meet two criteria: 

1. It should be appealing enough to motivate 
Ss toward a solution, and give them the feeling that 
it could be solved. 

2. Though technically soluble on the basis of the 
instructions given, practically speaking it was not, 
thereby presumably establishing a lack of closure in 
S. 


S was then given five minutes to solve this 
problem,® after which the remainder of 
the water-jar problems were administered. 

The selection of Ss for the strong stress 
group followed a somewhat different pro- 
cedure, involving a preliminary administra- 
tion of a group projective device, the Levy 
Movement cards (24). The Ss were told that 
they were part of a standardization group for 
a new projective device. The E announced 
that Levy Movement protocols were to be 
evaluated by a board of clinicians, and that 
those people whose records were question- 
able would probably be called back for addi- 
tional testing. 

Twenty-five people selected at random from 
a larger group of 108 volunteers, were noti- 
fied about one week later that it would be 
necessary for them to be tested further. These 
people who made up the strong stress group 
were given the criterion test of problem- 
solving rigidity anywhere from three to ten 
days following receipt of the notice for fur- 
ther testing. 

Contrasted to the casual and informal test- 
ing environment set up for the control and 
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mild stress group, the prevailing atmosphere 
for the strong stress group was cold, informal, 
and businesslike. Attempts at conversation 
by the Ss in this group were brusquely cut off 
by E, who proceeded directly to the water- 
jar test. Before beginning the administration 
of the test, E read a prepared statement to S, 
saying that evaluation of S’s projective test 
protocol had pointed to the presence of cer- 
tain maladaptive personality features, and 
that the purpose of the testing about to take 
place was to provide further data to test the 


accuracy of the diagnostic formulation. After 
reading this statement, the practice problems 
and test series of water-jar problems were 
administered in exactly the same way as was 
done in the control group. 

At the end of the experiment each S filled 
out a series of 7-point rating scales, based on 


attitudes toward the tests taken. When this 
was done all Ss were given a complete expla- 
nation of the nature of the experiment. The 
Ss in the strong stress group were advised 
that their selection had in no way been based 
on their projective test records, which, they 
were informed, had not even been scored. 
Enough time was spent with each individual 
to insure that he was aware of the random 
basis of his selection, and was satisfied that 
the present experiment in no way provided 
a basis for the appraisal of his personality or 
adjustment. 


RESULTS 


The test series of water-jar problems, listed 
in Table 1, consists of eleven problems. The 


8 Two Ss solved the puzzle within the time limit. 
These Ss were dropped from the study. 


Instructions: 


On the left there are nine dots. The object of this task 
is for you to draw four straight lines which will pass 
through all 9 dots. The lines may intersect. 


The only rules which must be observed are: (a) no 
more than four lines may be used, (4) all lines must 
be straight, (c) all dots must be passed through, (d) once 
you have started you may not lift your pencil from the 
paper. 


This is not a “trick” puzzle—it can be solved if you 
think clearly. You will be allowed five minutes to work 
it out. 


Puzzite Usep to EstasiisH Micp Stress 


first of these is a control problem which may 
be solved either directly (A-C) or indirectly 
(B-A-C-C). Pzesumably without previous ex- 
posure to a given Einstellung, subjects will 
tend to use a direct solution in this problem. 

The next two problems are set-builders 
which may be solved only by the indirect so- 
lution (B-A-C-C). Four crucial problems 
follow; these are problems which can be 
solved either by a direct or indirect solution. 
After this there is one additional set-builder, 
followed by an extinction problem. The ex- 
tinction problem requires an entirely new 
solution (A-++-C). Finally there are two more 
crucial problems which may be solved either 
directly or indirectly. 

Since problem-solving rigidity is defined 
here as the tendency to adhere to an induced 
method of problem solution when it ceases 
to be the most direct one, the primary meas- 
ure of this variable consists of the number 
of crucial problems which are solved by an 
indirect rather than a direct solution. In the 
present test series, it was possible to obtain a 
maximal rigidity score of seven (based on 
six crucials and one extinction problem). 

Certain secondary and somewhat overlap- 
ping measures of rigidity are also available. 
These include slower average time of response 
to the test series, failure to solve the extinc- 
tion problem and/or delayed time of response 
to this problem. 

A first step in the analysis of these data 
has been the simultaneous examination of 
differences in means between the three ex- 
perimental groups, by means of an analysis 
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TABLE 2 


Comparison oF Numser oF Ricip SoLuTIONs To THE WaATER-JAR ProBLeMs 


(Groups C, MS, and 


SS; N=25 each)* 








Drier. Ber. 


CoMPARISON 


MEANS 


t-Ratio P VARIANCE 





C vs. MS 
C vs. SS 
MS vs. SS 


Homogencous 
Homogeneous 
Homogencous 


2.038 
7.680 
4-190 


-05 
-OoI 
-OOI 





* Li = .9415 indicates over-all homogeneous variance; analysis of variance F-ratio = 23.04, significant at 


beyond the .oo1 level. 


of variance technique. Since this technique 
presumes approximately homogeneous vari- 
ance amongst the groups being compared, as 
well as random assignment of Ss to groups, 
it was necessary in each case to precede it by 
the Welch-Nayer (L1) test for homogeneity 
of variance (19, 23). Where the results of 
this test indicated approximately homoge- 
neous variance it was considered justifiable to 
proceed with the analysis of variance. In those 
cases where the analysis of variance yielded 
a statistically significant F-ratio, it was con- 
sidered defensible to compare individual 


group means using Fisher’s t-test. Since this 
test also presumes approximate homogeneity 


of variance between the two groups being 
compared, it was necessary in all cases to pre- 
cede it with an F-test for homogeneity of 
variance between the two groups being con- 
trasted. 

This general technique was used to study 
the primary operational measure of rigidity, 
the number of indirect solutions to the crucial 
water-jar problems. The results of this com- 
parison are summarized in Table 2. Here, 
the use of the Lr test demonstrates that there 
is approximately homogeneous variance 
among groups. The obtained highly sig- 
nificant variance ratio indicates that there is a 


statistically reliable over-all difference between 
group means. Using the .05 confidence level 
as an arbitrary cutting point, subsequent 
comparisons of individual group means dem- 
onstrate that the control group has performed 
significantly less rigidly than either the mild 
or strong stress groups on this primary meas- 
ure of problem-solving rigidity. Likewise, 
the mild stress group is significantly less rigid 
than the strong stress group. 

Turning now to the secondary measures of 
rigidity, Table 3 presents the results of the 
comparisons of the three groups with respect 
to average time of response to the water-jar 
problems. Once again, approximately ho- 
mogenous variance is demonstrated by the L1 
test, and a significant over-all difference in 
group means by the analysis of variance. Ap- 
plying Fisher’s ¢ for comparison of individ- 
ual group means it is observed that the con- 
trol group demonstrates a significantly more 
rapid average time of response to the water- 
jar problems than does either the mild stress 
or strong stress group, and that the mild 
stress group, too, responds more rapidly to 
these problems than does the strong stress 
group. 

The last rigidity measure for which this 
general type of statistical analysis was made 


TABLE 3 


Comparison OF AverAGE Time oF REsPONSE TO THE WaATER-JAR PrRoBLEMS 


(Groups C, MS, and 


SS; N=25 each)* 











Group 


Group MEAN CoMPARISON 


Drer. Ber. 
MEANS 


t-RaTio Pp VARIANCE 





21.28” C vs. MS 
30.36" C vs. SS 
40.12” MS vs. SS 


9.08 
18.84 


Cc 
MS 
SS 


9-76 


Homogencous 
Homogeneous 
Homogeneous 


2.564 
4.868 
2.259 


+02 
-OO!1 
+03 





® L1 = .9346 indicates over-all homogeneous variance; analysis of variance 


beyond the .oo1 level. 


F-ratio = 11.535, significant at 
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was for time of response to the extinction 
problem (i.¢., resistance to the extinction of 
an inappropriate set as a secondary measure 
of problem-solving rigidity). The results of 
this comparison are summarized in Table 4. 
Again we find a nonsignificant Li and a sig- 
nificant variance ratio, F. The comparison of 
individual group means* shows that both 
the control group and mild stress group re- 
sponded significantly more rapidly to the ex- 
tinction problem than did the strong stress 


group. Moreover the control group also re- 


sponded significantly more quickly to this 
problem than did the mild stress group. 

Chi square was used to compare the groups 
with respect to frequency of nonsolutions of 
the extinction problem. In the control group 
there was only one nonsolution while the cor- 
responding figures in the mild stress and 
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stress groups indicates that while the group 
means fall in the expected direction, the dif- 
ferences are small enough to be accounted 
for on the basis of chance alone. 


Discussion 

Statistical analysis reveals that under in- 
creasingly stressful psychological conditions 
there is a greater tendency to adhere to an 
induced behavior which has become inappro- 
priate. It now remains to inquire somewhat 
more deeply into the possible underlying psy- 
chodynamic factors capable of throwing ad- 
ditional light on this finding. 

A first clue may be provided by the attitude 
scales which demonstrate that more negative 
attitudes appear to be expressed toward situ- 
ations of more intense psychological stress. 
Such attitudes may generally be viewed as 


TABLE 4 


CoMPARISON OF AVERAGE Time oF RESPONSE To THE WATER-JAR EXTINCTION ProBLEM 
(Groups C, MS, and SS; N=25 each)* 








Drrer. Ber. 
MEANS 


Group 


CoMPARISON t-RaTio p VARIANCE 


Group 





Homogeneous 
Homogeneous 
Homogencous 


2.707 .O1 
7.168 .0o1 
3.219 -O1 


C vs. MS 
> uvs. SS 
MS vs. SS 


37-92 
86.88 
48.96 





9325 indicates over-all 
} ] 


I ievel. 


°ls = 
be yond the of 


strong stress groups were seven and thirteen. 
Analysis of this distribution yields a total chi 
square of 12.77, which for two degrees of 
freedom is significant between the .or and 
oor confidence levels. 

One other group of results shouid be briefly 
noted. An analysis of responses to the vari- 
ous attitude rating scales, given after the 
rigidity testing had been completed, was car- 
ried out. A simple analysis of variance here 
indicates that there are over-all differences in 
group means with respect to favorableness 
of attitude toward the rigidity testing situa- 
tion. Based on a seven-point rating scale, 
both the control and mild stress groups had 
significantly more favorable attitudes toward 
the test situation than did the strong stress 
group. Comparison of the control and mild 

4Where no solution of the extinction problem was 
achieved, 2.5 minutes was used as the time of response 


in calculating the group means. 


homogeneous variance; 


analysis of variance F-ratio = 19.823, significant at 


reflections of the manner in which the ex- 
perimental groups have perceived a given sit- 
uation. Thus in the present study there ap- 
pears to be an over-all relationship between 
order of magnitude of stress as presented by 
E and perception of stress by S. It is hoped 
that through a fuller understanding of the 
psychological impact of these situations upon 
a person, we may arrive at a sounder basis 
for understanding the manifest differences in 
behavioral rigidity. 

In the control situation no attempt is made 
to impose any stress whatsoever on S.° On 
the contrary, E seeks to establish a basically 
friendly and permissive atmosphere, and to 
help S feel that he is cooperating in a research 
project as one amongst many, rather than 


5 However even in this situation there may be implicit 
threat present for certain subjects by the very nature of 
the fact that they are taking “psychological tests” and/or 
being timed. 





PsycHOLoGIcAL STRESS AND PRroBLEM-SoLvING RIGIDITY 


being examined personally. Thus we may 
regard the control condition as a good learn- 
ing situation, one in which the S’s energies 
may be directed toward a relatively free ex- 
amination of the problems which are posed 
to him. He is able to form broad compre- 
hensive cognitive maps (22) which later allow 
him to shift readily to a more parsimonious 
problem-solving behavior when the external 
situation makes such a path to the goal 
available. 

In the mild stress situation, E initiated 
pressures are again minimal, and an attempt 
is made to put S at ease. However, after 
completion of the practice and control prob- 
lems of the test series, S is exposed to a periph- 
eral stress experience when he is given a 
puzzle which he is unable to solve. The stress- 
ful impact of this experience doubtless varies 
considerably from person to person. In gen- 
eral, however, it is believed that it may serve 
to interrupt the heretofore consistent posi- 
tiveness of the testing situation. S has been 
unable to deal with a problem which he feels 
he should have been able to handle. This 


may tend to make him sormewhat defensive, 
and no longer able to feel completely free in 


his exploration of the test situation. Now 
more directly motivated to succeed in subse- 
quent tasks, he is likely to be predisposed to 
somewhat greater adherence to previously 
utilized successful problem-solving behaviors. 
Hence we observe in the mild stress condi- 
tion, more rigidity than was present in the 
control situation. 

From the psychological point of view, the 
strong stress situation appears to differ rather 
markedly from either the control or mild 
stress situations. A more thoroughgoing ex- 
amination of the impact of this situation may 
help to explain the extensive rigidity of be- 
havior manifested by this latter group. 

These Ss entered the criterion testing situa- 
tion after having been exposed to a series of 
consistently negative stimuli. First, they were 
told that their personalities were to be ex- 
amined by a psychologist, probably a some- 
what threatening prospect per se. Next they 
were informed that some of them who ap- 
peared to show maladaptive personality fea- 
tures would be called back for further test- 
ing. This may be regarded as potentially 
quite disturbing and relevant to more central 
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areas of personality. Several days later, Ss 
who were to make up this group were noti- 
fied that they were to come back for addi- 
tional testing. This, in terms of the original 
structure of the situation, was equivalent to 
saying to § that based on psychological test 
data there is evidence that he is maladjusted. 

Clearly then, up to the time of the final 
testing situation, S had been exposed to a 
consistent series of insecurity breeding stim- 
uli. Moreover, each S entered the final testing 
situation faced with the immediate pros- 
pect of actually being told that he was mal- 
adjusted. In this situation, most Ss sought 
to initiate friendly conversation with the ex- 
perimenter, perhaps hoping to find something 
positive in the situation to counterbal- 
ance the overwhelmingly negative experi- 
ences to that point. However, E brusquely 
cut off such conversation, thereby adding one 
more source of discomfort to an already 
strongly threatening situation. E then read 
the supposed “evaluation” of S’s personality, 
indicating the presence of possible maladap- 
tive features. This was calculated to be the 
culmination of a series of threats, leaving S 
isolated and friendless, in an apparently hos- 
tile situation. 

In Goldstein’s terminology (9), we may say 
that S in the strong stress group had been 
confronted with a miniature “catastrophic” 
situation, in terms of a series of stresses with 
which he has been unable to cope. He has 
come to feel grossly inadequate, helpless, and 
psychologically insecure in this situation. 

We may regard the behavior of S in the 
strong stress group as characterized by a 
desperate searching for a mode of response 
which could, at least minimally, meet the 
demands being placed upon him. The early 
set problems in the test series provide such 
a “pseudo-security.” The S, eagerly seeking 
out some ego-redeeming response in an alien 
atmosphere, is likely to notice that a certain 
“formula” (B-A-C-C) may legitimately be 
used in the solution of the set problem. Such 
a discovery is of great importance to him. 
He has at last been able to do something 
right in the situation—to find a source of 
support in his search for security. 

When confronted with problems soluble 
by either this magic formula or by another 
more direct method, it is logical to anticipate 
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that S’s in this group would tend to turn 
immediately to the single response which has 
proved to be adequate in this situation, their 
sole hope to that point for salvaging some 
feelings of self-esteem. Clearly these Ss are 
not free to explore since each of their past 
behaviors with one exception (the Einstel- 
lung solution) has been inadequate. It is 
natural that they should cling to the single 
behavior which has succeeded in the past. 
Thus, it can be hypothesized, does the rigid 
problem-solving behavior of the strong stress 
group come about. 

A basic finding of the present research is 
that problem-solving rigidity increases under 
increasing degrees of psychological stress. At 
the outset, emphasis was placed on the specific 
definitions of the concepts of stress and 
rigidity being used in this investigation. It 
is at the same time important to be able to 
generalize somewhat beyond the present 
limited uses of these concepts. With this in 
mind, it should be noted that workers in 
this area have regarded the basic processes 
involved what we have called problem-solv- 
ing rigidity as constituting a paradigm of 
maladaptive behavior. Illustratively, Klee 
says, “behavior is abnormal when it persists 
in a situation where it is no longer relevant” 
(14, p. 41). 

Stress has been used as a means of eliciting 
certain desired behaviors in situations as 
varied as educational practice, psychotherapy, 
and international politics. That we should 
more carefully examine the possible dangers 
of such techniques has been suggested by 
other workers (7, 15, 17), and is clearly im- 
plied in the findings of the present research. 
A fuller discussion of this problem is avail- 
able elsewhere (4). Suffice it to emphasize 
here the importance of the relationship 
between rigidity, hostility, authoritarianism, 
and intolerance on the one hand, and be- 
tween love, understanding, and self-accept- 
ance on the other, which Frenkel-Brunswik 
points out. In her words “the struggle 
between these two orientations is basic to our 
civilization” (6, p. 141). 


SUMMARY 


The present study is an investigation of the 
hypothesis that increasing degrees of experi- 
mentally induced psychological stress will 
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elicit increasing amounts of problem-solving 
rigidity. Rigidity is defined as the tendency 
to adhere to an induced method of problem- 
solving behavior when the induced solution 
no longer represents the most direct and eco- 
nomical path to the goal. Stress is defined 
in terms of three operationally stated experi- 
mental atmospheres, calculated to bring 
about stress-perception of varying degrees of 
magnitude. Increasingly negative attitudes 
toward the test situation were expressed by 
subjects in increasingly stressful experimental 
atmospheres. Thus, the assumption of real 
differences between groups in the perception 
of the three experimental climates appears 
tenable. 

Seventy-five Ss took part in this experiment, 
25 in each of three groups—control, mild 
stress, and strong stress. All Ss took a water- 
jar test of problem-solving rigidity, on the 
basis of which four quantifiable rigidity 
indices were computed. Analysis of variance 
indicated that there were significant over-all 
differences between groups on the primary 
measure of problem-solving rigidity (total 
number of rigid solutions) as well as two 
secondary measures of this variable (average 
time of response, and average time of re- 
sponse to the extinction problem). On all 
three of these measures, comparison of the 
individual group means demonstrated that 
the control group was significantly less rigid 
than either the mild or strong stress groups. 
Moreover, in each instance the mild stress 
group was significantly less rigid than the 
strong stress group. Another secondary 
measure, frequency of nonsolution of the 
extinction problem, when studied by chi- 
square analysis also yielded statistically sig- 
nificant results in the hypothesized direction. 

It is concluded that the initial hypothesis 
appears to be a tenable one. In terms of the 
definition of variables set up in this study, 
increasing stress does in fact elicit increas- 
ingly rigid problem-solving behaviors. Some. 
factors perhaps capable of explaining these 
findings have been considered, and certain 
implications of the results have been 
suggested. 
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EXPECTANCY OF PUNISHMENT AS A FUNCTION OF TYPE 
OF DIFFERENTIATION IN ORIGINAL LEARNING * 


LEON LEVY 


The Ohio State University 


HE problem investigated by the present 

study was suggested by the clinical- 

social hypothesis that punishment 
which occurs in an undifferentiated situation 
has more effect on later anticipation of 
punishment than does punishment which 
occurs in a differentiated situation. 

Such an hypothesis offers the possibility of 
relatively rigorous test in terms both of 
learning theory and more pragmatic clinical 
hypotheses. Research relating experimentally 
based learning theory and perhaps less 
sophisticated but socially more immediately 
practical clinical notions seems to the authors 
to be maximally useful in the field of clinical 
psychology today. It is, of course, true that 


translation of the operations or definitions 
for learning concepts from those suitable for 
animals to analogues suitable for human 
beings is necessary for much of learning 


theory. 

An experimental test of this hypothesis re- 
quires the construing of the concepts “punish- 
ment,” “undifferentiated,” “differentiated,” 
“far-reaching effects,” and “anticipation” in 
such a manner that they are operationally 
defined and still “capture” as much of the 
connotation of these clinical-social concepts 
as is experimentally feasible. The authors 
recognize, of course, that such construing of 
concepts does not make the experimental 
and the clinical-social hypotheses identical in 
meaning. 

The authors believe that this procedure 
of construing clinical-social concepts, and 
thereby the hypotheses containing them, in 
rigorous operational form is one which is 
doubly fruitful. It provides the kind of in- 
formation which is most likely to be useful 
in clinical practice and also provides the type 
of knowledge which lends itself to more 
systematic use, especially to use in current 
learning theories. 

1 The authors wish to thank Dr. Charles Spiker of the 


Iowa Child Welfare Research Station for his critical and 
constructive reading of this paper and his suggestions. 
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lowa Child Welfare Research Station 


At the “practical level” the behest, “give 
the child techniques for solving his prob- 
lems” is an example of the implicit accept- 
ance of the hypothesis stated in the first para- 
graph. Explicit in psychoanalytic theory and 
implicit in the personality theory and thera- 
peutic practice of other systems is the notion 
that early childhood traumas are somehow 
deeper and more lasting as far as their effects 
on later learning are concerned, than are 
traumas occurring to the adult and older 
child, who have made more complete differ- 
entiation of their common life situations. It 
might be said that “less stimulus generaliza- 
tion,” using the term analogically, occurs in 
the Istrer case. 

Familiar examples from life may be fur- 
nished by the illustration of the small child 
(presumably having differentiated his en- 
vironment very incompletely) who, fright- 
ened by a dog, fears all dogs; the adult, 
similarly frightened, simply walks around 
the block to avoid the particularly hostile dog. 

It also appears to us that this clinical-social 
hypothesis, with its concepts defined accord- 
ing to the requirements set forth above, is 
one which is relevant to, although, strictly, 
not a test of, certain issues in current learning 
theories, particularly, the non-continuity-con- 
tinuity controversy (1); and the stimulus gen- 
eralization-differentiation or cognitive theory 
controversy. 

On the basis of this reasoning, the experi- 
mental hypothesis was set up: 

When negative reinforcement occurs in an inade- 
quately differentiated situation,? there will be more 
frequent verbal predictions by the subjects of nega- 
tive reinforcement in a future similar situation than 
there will be when negative reinforcement occurs in 
an adequately differeritiated situation.2 There will 
also be less accuracy in the subjects’ predictions of 
negative reinforcement, and they will verbalize less 


certainty about the prediction in general when the 
negative reinforcement occurs in the inadequately 


2 Operational definitions for “adequately” and “inade- 
quately differentiated situations” are given in the section 
describing experimental procedure. 
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differentiated than in the adequately differentiated 
situation. 


The negative reinforcer used in the present 
study was mild shock to the left forearm of 
the subject (S). Generally speaking, it is 
believed that situations which are differenti- 
ated for a given S (or group of Ss), as defined 
in the experimental procedure, will also be 
related to a class of verbalizations by the S, 
which class may be roughly described as 
containing references to S’s structuring or 
arranging of his awareness so as to provide 
a solution to the problem. The term “ex- 
pectancy” will be used to refer to “the verbal 
or otherwise signified predictions.” 


METHOD 
Subjects 


Forty-five elementary psychology students 
at the Ohio State University,’ of whom ap- 
proximately two-thirds were women, served 
as Ss. Other than assuring equal proportions 
of men and women in each of the experi- 
mental groups, the Ss were randomly 
assigned to experimental groups 1 to 3. 
Subsequent to the experiment, percentile 
standings on the Ohio State Psychological 
Examination were obtained for the Ss (one 
subject in both Groups 1 and 2, and two in 
Group 3 had no recorded test result). 
Table 1 gives quartile distributions of the Ss. 
Chi square, done by combining the upper 


TABLE | 


DistrisuTION oF SuByEcT GROUPINGS ON THE OHIO 
Sratre PsycHoLocica, EXAMINATION 








Group 2 Group 3 


Group 1 


PERCENTILE 





76-100 

51-75 

26-50 
0-25 





two quartiles into an “above the median,” 
and the lower two quartiles into a “below 
the median” group (to insure sufficiently 
large theoretical frequencies), results in a 


8 The authors wish to express appreciation to Dr. Delos 
D. Wickens for permission to use members of elementary 
psychology classes under his direction as experimental 
subjects. These students were required as a part of 
their coursework to participate in a certain number of 
experiments. 
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figure of 1.51, indicating (for two degrees of 
freedom) that differences of this order would 
occur by chance in between 30 and 50 per cent 
of such samples. Chi square, computed less 
legitimately for the table as it stands (there 
are insufficiently large theoretical frequencies 
in some of the cells) and using correction 
for continuity (3), results in the figure of 3.37, 
significant for six degrees of freedom at be- 
tween the 70 and 80 per cent level. Pre- 
sumably, then, Ohio State Psychological 
Examination quartile is unrelated to the 
experimental results. 


Procedure 


Differential Training Phase (DT). Dur- 
ing this phase, S was given the deck of 22 
cards and told that his task was to sort the 
cards into four different groups. This phase 
is considered to provide or permit adequate 
differentiation of the situation in the training 
phase (Tr). 

Each S was run through the card sorting 
once. Then he was questioned as to the 
principles involved. If he was unable to 
verbalize them immediately, some hints were 
given until he was finally able to state them 
all correctly. S was then run through the 
sorting test once again, as a check and to 
reinforce the concepts which he had de- 
veloped. S had to learn to sort the cards 
by color, number, form and miscellaneous 
categories. 

Training Phase (Tr). This phase fur- 
nishes the operation for “inadequate differ- 
entiation.” During this phase, S was seated 
in front of the exposure apparatus with 
electrodes fastened to his left forearm. The 
verbatim instructions given S were as follows: 


During this part of the experiment you will see 
cards similar to these (experimenter shows S a few). 
You will see them in the mirror in front of you for 
a period of five seconds each. At the end of five 
seconds the light inside the box will go off and you 
will no longer be able to see the card. When cer- 
tain cards are presented, at the end of the five 
seconds you will receive a mild shock to your left 
forearm; with the rest, nothing will happen. Your 
job during this part of the experiment is to try to 
discover the principles determining when the shock 
will occur. Try to figure out the rule so that you 
can predict when a shock will occur. You will have 
an opportunity in the last part of the experiment 
to demonstrate how much you have learned during 
this part. 
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There was approximately a three-second 
interval between exposures. When shock 
was presented, it occurred during the last 
half second of the exposure period and the 
light and shock terminated together. The 
principle which S was to learn was that mis- 
cellaneous cards resulted in shock and that 
all others did not. At the end of this phase, 
the subjects were questioned as to what they 
thought the principle was. Replies were 
recorded but S was told that the examiner 
couldn’t tell him whether or not he was right. 

Test Phase (Te). S was seated in front of 
the apparatus without electrodes on his arm. 
A telegraph key was placed in front of him 
to the right of the exposure device. He was 
asked to indicate each time he saw a card 
which he believed would have resulted in 
shock if the electrodes were in place, by press- 
ing the key. He was also asked for a state- 
ment of degree of certainty: one, very 
uncertain; two, fairly certain; and three, 
positive that the card in question is a shock 
card. No cues as to right or wrong were 
given in this phase. 

Record was kept of the cards on which 
each S expected shock as well as the degree 
of certainty with which it was expected. 
During Phase DT, record was kept of card- 
sorting errors as some indication of how well 
S had mastered the concepts. While to some 
extent the number of card-sorting errors was 
limited by the correction method used, they 
do give some indication of mastery since the 
number of errors included those made on the 
second trial. 

Table 2 summarizes the order in which the 
three groups were run through the experi- 
ment. Group 1 had differential training, 
then 40 presentations with eight negative 
reinforcements, and finally the test of ex- 
pectancy. Group 2 was first presented with 
the 40 trials and eight negative reinforce- 


TABLE 2 


SUMMARY OF EXPERIMENTAL SEQUENCE FOR 
Groups I, 2 AND 3 


PHast SEQUENCE 


Group 
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ments, then given the differential training, 
and finally the test of expectancy. Group 3 
was treated the same as group 1 with the 
exception that it did not receive the initial 
differential training. 

By this method it was possible to test for 
the effect upon expectancy of differentiations 
occurring prior to negative reinforcement as 
compared with the effect of differentiations 
occurring after negative reinforcement. The 
relative effect of differentiation, regardless of 
where it occurs, was also subject to test. 


REsULTs AND Discussion 


The test of the hypotheses is made in the 
test phase by comparing groups in terms of 
the number of button presses, these being 
taken as indicators of expectancy. The hy- 
potheses may now be stated symbolically in 
terms of frequency of expectancy of shock as: 


I. 3>1 
2. 2>1 
3. 3>2 


The null hypotheses to be tested become: 


1. There is no difference in expectancy of shock 
between 3 and 1 other than chance variation. 

2. There is no difference in shock expectancy be- 
tween 2 and 1 other than chance variation. 

3. There is no difference in shock expectancy be- 
tween 2 and 3 other than chance variation. 


Table 3, in which are summarized the fre- 
quencies of shock expectancy during Phase 
III (Test), demonstrates a rejection of null 
hypotheses 1 and 2, no rejection of null 
hypothesis 3. F for this table is 12.815, 
p<.oo1. The difference in the means for 1 
and either 2 or 3 is significant at less than 
the .oor level. Differences in the means for 
2 and 3 are not significant at or below the 
05 level (Table 4) (3). 

In other words, negative reinforcement 
occurring in an inadequately differentiated 
field results in greater expectancy of negative 


TABLE 3 


Frequency oF Expectancy oF SHock Durinc 
Puase III (Test) 








Group Tora MEAN SD 





Tr 
DT 
Tr 


79 
187 
171 
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TABLE 4 


DirFERENCES IN MEAN EXPECTANCY FOR 
Turee Groups 








Groups DIFFERENCE p 





<.0oo1 
<.001 
<.60>.50 


4-595 
4-292 
0.664 





reinforcement (in terms of frequency of pre- 
diction of “punishment”) than does negative 
reinforcement occurring in an adequately 
differentiated field. It also results in less 
accuracy of prediction, and in less verbalized 
certainty of prediction. (“Certainty of pre- 
diction” is defined in terms of E’s instruc- 
tions to Ss, and Ss’ responses.) 

The notion of “interference from anxiety” 
seems to parallel the results most accurately; 
but, because of the mild nature of the nega- 
tive reinforcement in this experiment, as well 
as the relative sophistication of the Ss, the 
introduction of such a construct seems 
inappropriate. 

Table 5 illustrates the fact that Group 1 
actually showed the least stimulus generali- 
zation (i.e., the subjects distinguished more 
accurately between shock and _ nonshock 
stimuli). 


TABLE 5 


Correct EXxpecTANCIES IN THE TEST PHASE 


SD SE 








Group Tora MEAN 





39 2.60 
21 1.40 
23 1.53 


. 880 





Table 6, for which F is 8.255 (p<.oor), 
lists differences in correct expectancies for the 
groups, with probabilities. 

If the experiment is viewed as a discrimina- 
tion problem, which seems logical, it may be 


TABLE 6 


DiFFERENCES IN CorrECT EXPECTANCIES 


Groups DIFFERENCE 





1.20 
1.07 


-13 <.70>.60 
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said that Group 1, as compared with Group 2, 
received no different amount of reinforce- 
ment for correct as opposed to incorrect 
choices in the differentiation phase of the 
training. This is demonstrated in Table 7. 


TABLE 7 


Errors 1n Carp SorTING IN THE DIFFERENTIATION 
PHASE 


SD SE 


MEAN 


Group Torat 





1.069 
+930 


216 14.4 
233 15-5 


3-999 
3-502 





However, the point at which these differen- 
tiations were made available for the groups 
differed: for Group 1, preshock; for Group 2, 
postshock. 

Table 8 presents the distribution and per- 
centage of expectancies by concept and com- 
pares these with the actual percentage of test 
cards belonging to each concept. 


TABLE 8 


Percent or Totat Expectancits Faruinc Into 
Eacu oF THE Concept CATEGORIES 





Carps in Eacn 





Concept 1 


N 








Color 6 08 43 22 
Form 22 28 61 ss. 2 24 
Number E an, a ae ae 
Misc. 39 49 #21 a 


14.364 
<.01 


x? 263.813 
Pp <.001 


3.871 
>.20 





Since in Groups x and 2 there was equal 
exposure during differential training in the 
differentiation phase to the four concept cate- 
gories and in Group 3 there was lack of 
differential exposure to any one of the four 
concept categories, any deviation in the dis- 
tribution of expectancies for a group from 
the distribution of available cards in each of 
the categories must be considered to be due 
to the effect of differential reinforcement in 
the training phase. In other words, if train- 
ing were omitted, one would expect the dis- 
tribution of expectancies by category for each 
of the groups to conform to the actual per- 
centage of cards available for them to react 
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to in each of the categories. According to 
the chi squares in Table 8, and accepting the 
foregoing assumption, one may conclude that 
the differential reinforcement in Phase Tr 
had no significant effect on the distribution 
of expectancy reactions for Group 2. 

At a more general level, one may speculate 
that each reinforcement gave S information 
about the hypothesis he was using for that 
particular card sort. Thus, in Group 1, the 
differential training in Phase Dt made certain 
hypotheses available for testing in Phase Tr. 
The reinforcement occurring in this phase 
tended mainly to confirm the miscellaneous 
hypothesis. Members of Group 2 did not 
have pertinent hypotheses available during 
Phase Tr, but formulated what hypotheses 
they could. During Phase Dt these hypothe- 
ses were found not to be among the relevant 
hypotheses. Since no further opportunity 
was presented to test any of the available 
hypotheses resulting from Phase Dt, the sub- 
jects in Group 2 adopted and dropped them 
in random fashion. Group 3, while not hav- 
ing the correct hypothesis available at any 
time during the experiment, also had no 
clear-cut basis for rejecting whatever hypothe- 
ses they may have been trying in Phase Tr. 
Apparently, from the data in Table 6, 
Group 3 had been working predominantly 
on hypotheses which were related to cards 
falling mainly in the number and miscel- 
laneous categories. Such an hypothesized 
process at least parallels the facts of Group 3’s 
deviation. 

Finally, in any experiment utilizing human 
subjects, the problem of motivation and other 
uncontrollable variables must be considered. 
Having placed the subjects in each of the 
three groups in random fashion and having 
given each subject the same instructions, it 
can only be assumed that differences in moti- 
vation between groups were not sufficiently 
great to account for the differences found in 
expectancy or learning. 

Because the situation was not equally dif- 
ferentiated for each of the groups, and be- 
cause individuals in general would rather be 
right than wrong, it was believed that the Ss 
for whom the situation was not as well differ- 
entiated might be reluctant to indicate many 
of the times when they expect shock. In 
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order to make the situation as permissive as 
possible, and to indicate to an S that he was 
not expected to be perfectly sure of each of 
his expectancies, a procedure was introduced 
whereby S indicated how certain he was of 
each of his expectancies. Besides making the 
situation more permissive, it was hoped that 
in this way some qualitative information 
might also be obtained. Table g presents the 


TABLE 9 


PercENTAGE oF ToTtaL Expectancies FaLiinc INTo 
THE Various CATEGoRIES OF CERTAINTY 








Decree oF CERTAINTY 





34% 
38%, 


34% 





distribution of expectancies by degrees of 
certainty. Category I indicates the least de- 
gree of certainty and Category III the great- 
est degree of certainty. 

In terms of degree of certainty, 46 per cent 
of the choices of Group I (differential train- 
ing before general training and shock) were 
at Level III of certainty; where differential 
training followed general training, as in 
Group 2, the percentage of Level III responses 
dropped to 33 per cent; and where no train- 
ing phase was given at all, as in Group 3, the 
percentage of “certain” responses dropped to 
19 per cent. Conversely, only 20 per cent of 
Group 1 choices were relatively tentative 
(Level I), while 29 per cent of Group 2 and 
47 per cent of Group 3 choices were tentative. 
In other words, the group introduced into 
the general training (phase 2) with some 
tentative modes of problem solving already 
presented to them were more sure of their 
judgments than a group whose training 
phase was followed by the differential train- 
ing phase 2; while the group with no differ- 
ential training were least certain of their 
decisions. An over-all chi-square test for 
Table 9 yields a value of 28.698, which for 
four degrees of freedom results in p<.oor. 

These findings are in general in line with 
those which do not consider the qualitative 
aspects of expectancy, as reported earlier in 


the paper. 
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SUMMARY 


An attempt has been made, through experi- 
mental use of the Wisconsin Card Sorting 
technique (2) and patterns of differentiation- 
training and negative reinforcement, to de- 
termine the effects of negative reinforcement 
introduced into an adequately differentiated 
situation as contrasted to an inadequately 
differentiated situation. 

In general, later learning (judged from 
accuracy of expectancy) is more efficient for 
the former than for the latter condition. 
Indeed, a group where negative reinforce- 
ment was introduced prior to differential 
training showed no greater accuracy in sub- 
sequent predictions of negative reinforce- 
ment than did a group where no differential 
training whatever was given. Further, the 
group which had first received differential 
training, then negative reinforcement, ex- 
pected significantly fewer shocks in a test 
situation than did a group given first nega- 
tive reinforcement, then differential training, 
then a test; or a group given only negative 
reinforcement followed by test. 

The results of the test of the experimental 
hypothesis have implications for the fields of 
clinical psychology and child development, 
provided two assumptions are made. If the 
findings may be generalized from the college 
student population on which they are based, 
and insofar as the experimental concepts 
parallel or are analogous to the clinical-social 
concepts, then one may by successive stages 
of induction arrive at the stricture, “Give the 
child techniques for solving his problems.” 
Similar inductive steps would lead to the 
axiom relating the “toughness” of childhood 
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traumas to difficulties in later adaptive learn- 
ings. Implications for adult experience are 
also present: orientation programs (¢.g., to 
college, hospital, exam sessions) are analogi- 
cally similar to this experimental situation. 
Again, analogically, the susceptibility of the 
average boy or girl to sexual trauma (the area 
of sex being poorly differentiated, on the 
average) might be thought of in light of 
these experimental findings. 

However, the authors wish to repeat that 
this experiment is one of analogues, not of 
rigorous derivations: the social-clinical con- 
cepts are broad and loose, operationally 
speaking; the experimental operations defin- 
ing the same concepts are much narrower 
and tighter; and generalization from the 
narrow and relatively tightly defined experi- 
mental concept to the broad and loosely 
defined social-clinical concept is, at the best 
tentative, and at the worst, misleading. 

Finally, the authors feel that this attempt 
to clarify the relationships between certain 
pragmatically grounded clinical and child 
development hypotheses and hypotheses de- 
rived from learning theories has been mod- 
erately successful, and perhaps more useful 
than either approach followed alone. 
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ETHNOCENTRISM, “INTOLERANCE OF AMBIGUITY,” AND 
ABSTRACT REASONING ABILITY * 
PATRICIA O'CONNOR 
Radcliffe College 


His paper presents the results of an ex- 

periment designed to investigate dif- 

ferences in abstract reasoning ability 
among individuals of varying degrees of eth- 
nocentric prejudice. The study also explored 
the relation between “intolerance of ambigu- 
ity” and the ability to reason abstractly as 
well as the relation between ethnic prejudice 
and “intolerance of ambiguity.” Interest in 
the problem derives from the evidence that 
ethnocentric prejudice is associated with dif- 
ferences in perception and cognition. Percep- 
tual perseveration is positively correlated with 
ethnocentrism (3) and “intolerance of am- 
biguity” (10) as measured by attitude scales. 
In problem solving, prejudiced subjects tend 
to show rigidity (5, 7) and concreteness (7). 
They are also less adept at forming the con- 
cept of size as a basis for grouping objects 
and at reasoning deductively in problems in 
which there is more than one correct an- 
swer (5). 


ExpERIMENTAL PROCEDURE 


Subjects 


Subjects (Ss) for the experiment were 
seventy-seven Harvard undergraduates en- 
rolled in an introductory course in social 
relations. Administered by the instructors, 
the questionnaires were answered anony- 
mously by groups of twelve to twenty stu- 
dents in section meetings. Results of the 
study are based on the responses of fifty-seven 
Ss; questionnaires of Ss having previous ex- 
perience with syllogistic reasoning were dis- 
carded before the data were analyzed. 


Questionnaire Measures 


The experimental questionnaire has three 
parts: 15 syllogisms, each consisting of two 
premises and three conclusions, Ss were to 


1 The writer expresses her deep appreciation to Dr. 
R. D. Walk under whose direction the experiment was 
conducted and the original thesis prepared. She also 
thanks Dr. G. Lindzey for his many he'pful suggestions 
in the writing of this paper 


mark each conclusion as valid or invalid in 
terms of the stated premises; a social atti- 
tudes scale measuring ethnocentrism and “in- 
tolerance of ambiguity”; questions as to col- 
lege grades for the previous year (midsemes- 
ter marks for freshmen), and religious pref- 
erence. 

The syllogism scale, used as a test of one 
kind of abstract reasoning ability, was taken 
from Wilkins’ (13) study of the effect of 
changed material on the ability to reason 
syllogistically. Five logical forms each ap- 
pearing three times with different contents 
were employed. The order was random with 
the two restrictions that no two syllogisms of 
the same form or type of content follow each 
other directly. The contents were of three 
types: familiar, symbolic, and “tricky.” Re- 
sponses to each conclusion were graded as 
correct or incorrect and scores computed for 
each of the three types of content and for 
the total scale. Reproduced below is one set 
of syllogisms. 


(Familiar) Some people in the audience are 
laughing. Some children are in the audience. 
Therefore 

(a) some children are laughing; 
(4) some of the people laughing are children; 
(c) some children arc not laughing. 


In the familiar material the objects in the 
content are familiar, although the facts are 
not within the actual experience of the sub- 
jects. 


(Symbolic) Some a’s are b’s. Some c’s are a’s. 
Therefore 

(a) some c’s are b’s; 

(4) some b’s are c’s; 

(c) some c’s are not b’s. 


The symbolic syllogisms are of the same 
form as the familiar, but the content consists 
of letters rather than words. 


(Tricky) Some dogs have long hair. Some pet 
animals are dogs. Therefore 

(a) some pet animals have long hair; 

(4) some long-haired animals are pets; 


(¢) some pet animals do not have long hair. 
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In the tricky material, the truth or falsity 
of the statements in terms of experience is at 
variance with the logical validity in terms of 
the premises. In the example, the conclu- 
sions, correct in the experienced world, are 
invalid deductions from the premises. 

The social attitudes scale consisted of the 
eighteen items from the final revision of the 
California E (Ethnocentrism) scale, measur- 
ing anti-Negro, anti-Semitic, and chauvinis- 
tic attitudes and Walk’s (10) scale measuring 
“intolerance of ambiguity.” For the present 
study, in which E and A scale results were 
correlated, retaining the scales in their stand- 
ard wording so that agreement on all items 
represents ethnocentrism and “intolerance of 
ambiguity” could produce a false correlation 
between the variables measured. For this 
reason some items on each scale were changed 
so that agreement would indicate anti-ethno- 
centrism and “tolerance of ambiguity.” Re- 
sponses were scored on a seven-point scale 
according to direction and intensity of opin- 
ion, high scores representing ethnocentrism 
and “intolerance of ambiguity.” For Ss with 
a Jewish religious preference, E scale results 
were based on the scales measuring anti- 
Negro attitudes and excessive patriotism, 
since for Jewish Ss scores on the anti-Semitic 
scale could not be interpreted as measuring 
outgroup prejudice. 

The A scale, unpublished elsewhere, is re- 
produced below. In the questionnaire itself, 
E scale and A scale items were arranged at 
random. 


1. There is more than one right way to do 
anything. 

2. It is always better to have a definite course 
of action than to be vacillating among several 
possibilities. 

3. The best leaders give specific enough instruc- 
tions so that those under them have nothing to 
worry about. 

4. A smart person gets his life into a routine so 
that he is not always being bothered by petty 
details. 

5. Nobody can have feelings of love and hate 
toward the same person. 

6. It is better to keep on with the present method 
of doing things than to take a way that might lead 
to chaos. 

7. A man can be well informed even if there 
are many subjects upon which he does not have a 
definite opinion. 

8. It is better to take a chance on being a failure 
than to let your life get into a rut. 


REsuULTs 


Statistical Analysis 


In statistical treatment of the data it was 
decided to accept hypotheses indicating a re- 
lationship of a magnitude that would occur 
less than one time in twenty as a function of 
sampling variation. Since the hypotheses were 
one-directional, significance at the .05 point 
was required for rejection of the null hypoth- 
esis. 


Propositions 


Hypothesis 1. Ethnocentrism is positively related 
to “intolerance of ambiguity.” 


Scores on the E and A scales show a posi- 
tive correlation that is statistically significant 
(r=.55; P<.05). Similarly a ¢-test indicates 
that Ss above the median in ethnocentrism 
are more “intolerant of ambiguity” than those 
below the median (#=4.00; p<.05). The 
hypothesis was accepted. 


Hypothesis 2. Ethnocentrism is related to a poor 
ability to reason abstractly. 


Since syllogistic reasoning ability correlates 
positively with intelligence (13), it is neces- 
sary that any tendency intelligence has to 
vary with ethnocentrism be controlled when 
interrelating E scale results and syllogism test 
scores. College marks were used as a measure 
of intellectual ability. A correlation of —.24 
was found between high grades and ethno- 
centrism. This is similar to the r=—.2 re- 
ported by Levinson (1, pp. 280-284) as a 
fairly consistent correlation between E scores 
and IQ test results. All parts of the syllogism 
scale correlated positively with high grades; 
therefore, partial correlation was used to con- 
trol the effects of the high grades-low preju- 
dice relationship. It was unnecessary to con- 
trol the variable of academic grades in the 


TABLE 1 


CoRRELATIONS BETWEEN E Scores AND SYLLOGIsM 
Scores witH A Scores Heip Constant (N=57)* 





ConTENT r Pp 





Total 368 2.94 <.05 
Familiar .318 2.48 <.05 
Symbolic -351 2.63 <.05 
“Tricky” .220 1.67 <.05 





*A t of 1.65 is required for significance at the .o5 
point. 
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t-tests because “high E’s” and “low E’s” were 
almost identical in mean rank group. 

Partial correlation was also used to ascer- 
tain what the relationship between ethnocen- 
trism and abstract reasoning ability would be 
with “intolerance of ambiguity” held con- 
stant. As Table 1 shows there is a significant 
relationship between ethnocentrism and a 
poor ability to reason abstractly. The hypoth- 
esis was accepted. 

Hypothesis 3. “Intolerance of ambiguity” is re- 
lated to a poor ability to reason abstractly. 

Since either by correlation or ¢-tests A score 
does not vary with college grades, no correc- 
tion was made for this factor in analysis of 
data. 

The correlation between syllogism score 


Patricia O’ConNoR 


subjects who are not high in both E-scale 
and A-scale scores tend to be very similar 
as groups in their performance on the syl- 
logism test. There is no indication from the 
data that “intolerance of ambiguity” apart 
from ethnocentrism is associated with a poor 
ability to reason abstractly. The hypothesis 
is therefore rejected. 


Hypothesis 4. Ethnocentrism is related to a 
tendency to be less successful in dealing with cog- 
nitive tasks that require an abstract orientation 
than with problems requiring a more concrete 
orientation. 


To test this assumption, Ss’ scores on the 
familiar material were compared with their 
performances on the symbolic and tricky ma- 
terial by a somewhat involved statistical pro- 


TABLE 2 


DirFERENCE IN MEAN SyYLLOGISM ScorE FOR SUERJECTS ABOVE AND BELOW THE MEDIAN IN E anp A Scores 
(N=57) 
7 








Groups N MEAN Score 


Mean Dir. 





Low E-Low A 
High E-High A 


Low E-High A 
High E-High A 


High E-Low A 
High E-High A 2 


30.32 
25.05 
29.50 
25.05 


30.71 
25.05 


5.27 1.96 


4-45 2.84 


5.66 2.88 





*Aztof 2« 


and A scores indicates “intolerance of am- 
biguity” is not related to ability to reason 
abstractly if E scores are held constant. The 
direction of relationship is that predicted for 
the total and tricky material but not for the 
familiar and symbolic contents. Correlations 
vary from —.12 (tricky material) to +.04 
(symbolic) and do not approach statistical 
significance in any instance. 

The Ss were divided into four groups on 
the basis of having E scores and A scores 
above or below the median. Comparisons 
were made among high E’s-high A’s, high 
E’s-low A’s, low E’s-high A’s and low E’s- 
low A’s. Ss who are above the mean in 
both ethnocentrism and “intolerance of ambi- 
guity” are significantly poorer in abstract rea- 
soning than those below the mean in both 
measures. The high E’s-high A's do not differ 
significantly from the other two subgroups, 
but the trend is marked (Table 2). The 


5 is required for significance at the five per cent level of confidence. 


cedure. Each S’s scores on the symbolic and 
tricky materials were subtracted from his 
score on the familiar material. The result- 
ing sets of figures were correlated with E- 
scale scores. Performance on the familiar 
content was held constant by partial correla- 
tion. since Ss who had high scores on the 
familiar material tended to lose more points 
as compared with the other Ss. By this means 
it was tested to what extent a poorer perform- 
ance on the symbolic or tricky material as 
compared with the familiar material is asso- 
ciated with ethnocentrism. 

Ethnocentrism correlates .15 (t=1.08) with 
a relatively poor performance on the symbolic 
material and .18 (t=1.47) with a compara- 
tively low score on the tric!:y material. Since 
a t of 1.65 is required for significance at the 
.05 point, our hypothesis must be rejected. 


Hypothesis 5. “Intolerance of ambiguity” is 
related to a tendency to be less successful in deal- 
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ing with cognitive tasks that require an abstract 
orientation than with tasks requiring a more con- 
crete orientation. 


“Intolerance of ambiguity” is positively re- 
lated to a comparatively low score on the 
symbolic material (r=.09, t==.69) and on the 
tricky material (r=.18; t=1.36). The hy- 
pothesis is not supported to a degree reaching 
statistical significance and is thus rejected. 

Although they do not reach statistical sig- 
nificance, the tendencies for ethnocentric sub- 
jects and subjects who are “intolerant of am- 
biguity” to perform less successfully on the 
abstract than on the concrete material is in 
the direction hypothesized in all cases. For 
this reason, the rejections of hypotheses 4 
and 5 are tentative ones. It would be desir- 
able to set up experiments with other kinds of 
cognitive tasks as further tests of the hypoth- 
eses, since our results are too vague to allow 
definite conclusions. 


Discussion 


Results of this study indicate that, as an- 
other investigator (3) has found, ethnocen- 
trism is associated with “intolerance of am- 
biguity.” They also substantiate the idea that 
ethnocentric people have more difficulty in 
reasoning abstractly than. less prejudiced 
people of comparable intelligence (5). Not 
supported, however, is the notion (3, 7) that 
the element in ethnocentrism related to poor 
abstract reasoning ability is “intolerance of 
ambiguity.” 

Goldstein and Sheerer’s (4) work had 
shown that brain-injured patients are char- 
acterized both by rigidity and an inability to 
adopt the abstract attitude. To Rokeach (7) 
this served as a rationale for predicting that 
since prejudiced people were “rigid” they 
would also be “concrete.” In finding that 
ethnocentrism is related to rigidity (a form 
of “intolerance of ambiguity”), and concrete- 
ness to prejudice and rigidity (7, 8, 9), he did 
not clarify more precisely the interrelation- 
ship among the three variables. 

From the present study it would seem that 
“intolerance of ambiguity” is related to poor 
abstract reasoning ability only when it is asso- 
ciated with ethnocentrism. Thus the aspect 
of ethnocentrism relevant to this problem is 
one different from “intolerance of ambigu- 


’ 


ity.” Frenkel-Brunswik (3) has pointed out 


that in prejudice the criterion of judgment is 
generally an external one, such as color. “Ex- 
ternal” is similar in meaning to perceptual 
or concrete. That is, people who accept or 
reject others on the basis of the ethnocentrist 
are “abstracting concretely.” 

A suggestion derivable from the present 
study is that it is the concreteness of ethno- 
centrism apart from “intolerance of ambigu- 
ity” that is related to an inability to reason 
abstractly. If individuals were to adopt the 
concrete attitude in both their social relation- 
ships and in their cognitive orientations, eth- 
nocentrism and a poor ability to reason ab- 
tractly would result. The evidence that 
rigidity and concreteness are sometimes asso- 
ciated (7) does not imply that they must al- 
wavs occur together. It is not certain that a 
person who is “intolerant of ambiguity” 
would, of necessity, avoid ambiguities by a 
concrete approach; nor would the concrete- 
ness of ethnocentric prejudice always be a 
manifestation of a generalized “intolerance 
of ambiguity.” It is the argument of this 
paper that the relationship among the three 
variables is: ethnocentrism is associated with 
a relatively poor ability to reason abstractly 
and with “intolerance of ambiguity.” “In- 
tolerance of ambiguity” and abstract reason- 
ing ability are not related apart from their 
common tendency to vary with degree of 
ethnocentrism. 


SUMMARY AND CONCLUSIONS 


In an attempt to investigate the relations 
between abstract reasoning ability, ethnocen- 
trism, and “intolerance of ambiguity,” a test 
of syllogistic reasoning together with attitude 
scales measuring ethnocentrism and “intoler- 
ance of ambiguity” were filled out anony- 
mously by fifty-seven students in a classroom 
setting. 

The results indicate that: 

1. Ethnocentrism is positively associated 
with “intolerance of ambiguity.” 

2. Ethnocentrism is related to a poor ability 
to reason abstractly. 

3. “Intolerance of ambiguity” as measured 
in this study is associated with a poor abil- 
ity to reason abstractly only if accompanied 
by ethnocentrism. 

4. The notion that concreteness in orienta- 
tion and the inability to reason abstractly are 
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related to some aspect of ethnocentrism other 
than “intolerance of ambiguity” should be 
investigated. 

5. There seems to be a tendency for ethno- 
centric subjects and subjects who are “intol- 
erant of ambiguity” to be less successful in 
solving problems that require a relatively 
concrete orientation. This hypothesis has not 
been adequately tested in the present study 


and should be tested by other means. 
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He M function in the Rorschach has 

been shown by past studies to be a rela- 

tively stable function. Hertz reported 
a split-half reliability of the M function as 
.745 (4); Thornton and Guilford (9), one of 
.768 and one of .g19; Vernon (11), one of 
.745- Altus and Thompson (1), using the 
test-retest method, reported a tetrachoric co- 
efficient of reliability of .93, an r of 82. In- 
aningate movement (m) appears to be an 
unreliable measure; Altus and Thompson (1) 
reported a tetrachoric reliability coefficient 
(test-retest) of only .15. The reliability of 
FM, animal movement (5), does not appear 
to have been measured. 

Rorschach (7) claimed that the projection 
of humans in action on his ten ink blots 
was an indication, among other things, of 
intelligence at work. Subsequent investi- 
gators have borne him out. Hertz (4) re- 
ported an r of .259 between M and IQ level 
for junior high school students; had she used 
MA instead of IQ for this group the r un- 
doubtedly would have been higher. Witten- 
born (12) also reports a positive relation be- 
tween the number of M in the Rorschach 
protocols among Yale undergraduates and 
certain measures of aptitude and of intelli- 
gence. Altus and Thompson (1), also using 
undergraduates, reported two r’s, .37 and .41, 
between the number of M projected and two 
measures of verbal intelligence. Tucker (10), 
using 100 neurotic veterans, reports an r of 
.262 between full-scale Wechsler IQ’s and the 
number of M; rather interestingly, the single 
subtest, Similarities, gave an r of .447 with 
the M function, a coefficient which is con- 
siderably higher than for the full-scale IQ. 
Sicha (8) found, however, that very bright 

1 This study was subsidized in part through a grant 


from the Research Committee, Santa Barbara College, 
University of California. 
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adult Negroes gave only about one-half the 
number of M given by a control group of 
whites. So far as is known, this “race” 
differential has not been followed up. Ror- 
schach’s belief in M as a partial measure of 
intelligence appears, therefore, to have been 
vindicated. 

Wittenborn, in his factor analysis of certain 
aspects of the Rorschach, reported that “An 
examination of numerous scatter diagrams 
between a variety of scoring categories gave 
no indication that the relationships were 
nonlinear” (13, p. 261). On the other hand, 
Cronbach claims that “Many of the significant 
Rorschach scores give sharply skewed dis- 
tributions for most populations” (3, p. 405); 
he further says “The regression of W on R 
is definitely curved” (3, p. 413). W and R 
are two of the 20 Rorschach variables used 
by Wittenborn in his factor analysis. Altus 
and Thompson (1) reported a nonlinear re- 
lationship between M and two measures of 
verbal intelligence. Where the number of 
M gave an r of .37 with the Altus Measure 
of Verbal Aptitude, the corresponding eta 
was .54; with the Ofio Psychological Ex- 
amination, Form 21, the r was .41, the eta, 
63. The N for the first set of coefficients 
was 128; for the second, 100. Despite Witten- 
born’s statement relative to his scattergrams, 
there appears to be more than a little dubiety 
about the acceptance of linearity among cer- 
tain significant Rorschach variables. 


Tue Present Srupy 


The present study was designed to inquire 
further into the following problems: (a) 
What type of M brought about the curvi- 
linearity between M and verbal intelligence 
noted by the writers in their previous study? 
(4) Are Beck’s popular M’s(2) so fully 


saturated with verbal intelligence as are the 


3 ace OE GARAR IED ASS AAPL ne Be ie 





532 


more original M’s, i., those which are not 
popular? (c) Is FM related to intelligence 
and to M, and if so, is the relationship linear? 
(d) Does summating M and FM increase the 
observed relationship with an_ intelligence 
test? The population used consisted of 100 
college undergraduates. The group Ror- 
schach was administered to them, following 
Munroe’s directions (6). The obtained r’s 
and correlation ratios are given in Table 1. 


TABLE | 


Some r’s AND ETAS, WITH THE PropaBiLity (p) OF 
, SIGNIFICANT DIFFERENCE BETWEEN THEM 


VARIABLES CORRELATED 





‘ 59 . 100 
Ohio * vs. (M—popular M)... .40 .68 . 100 
Ohio * vs. (M+FM) -33 -§0 . 100 
Ohio * vs. popular ! -I9 .34 . 100 
Ohio ® vs. FM.... 26 9 « 100 





* Ohio Psychological Examination, Form 21. 


It will be noted that the r between M and 
FM is .29; this figure is fairly close to the r 
of .375 found by Wittenborn (13). The re- 


gression of M on FM is, however, definitely 
curved, since eta reaches .59 and the confi- 
dence level for the difference between the two 


correlations is oor. The number of M after 
the removal of the stereotyped or popular 
M gives an r of .40 with the Ohio, a figure 
which is synonymous with the .41 found 
previously for the total number of M, in- 
cluding the populars. The eta for the un- 
stereotyped M and Ohio is .68. Since our 
level of confidence for the difference between 
the two types of correlations is here .oo1, it 
may fairly definitely be assumed that the 
regression of the unstereotyped M on intelli- 
gence is nonlinear. It will be noted in the 
table that the r and eta for popular M versus 
Ohio are .19 and .34, respectively; the dif- 
ference between the two correlations is not 
significant. The unstereotyped M is obvi- 
ously the better measure of intelligence; and 
the relationship is significantly curved, while 
that of the relatively stereotyped M appears 
not to be. It must be admutted, however, 
that the range of the poular M is not so great 
as for the M which is not popular. This dif- 
ference in range may account for the low 
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r of popular M with Ohio and the lack of 
significant curvilinearity between the two 
variables. 

The sum of M+ FM yields an r of .33, 
an eta of .50, when correlated with the Ohio. 
The addition of FM to M tends to rule out 
the certainty of curvilinearity of regression; 
and, likewise, it tends to reduce the saturation 
of M with intelligence. FM by itself gives 
an r of .19, an eta of .35, with the Ohio— 
the difference between the coefficients being 
of no statistical significance. The m, or inani- 
mate movement, variable gave too norrow 
a range to justify the use of an r; conse- 
quently, a tetrachoric coefficient (graphic es- 
timation) was calculated between m and 
Ohio: It came to .16. The hierarchy of the 
movement function in the Rorschach when 
scaled in terms of its association with intelli- 
gence for college students appears to be as 
follows: (a) the unstereotyped M (M— 
popular M) yield the highest coefficient; (4) 
M, or the sum of stereotyped and unstereo- 
typed M, is next; (c) FM, popular M, and m 
all yield positive, though low, correlations. 

The study by Tucker (10) mentioned pre- 
viously might appear to offer some disagree- 
ment with the findings reported above. He 
found that the sum of FM and m correlated 
.350 with the full-scale IQ’s of 100 male neu- 
rotics, while the corresponding r for M is 
only 262. The writers suspect that the rela- 
tively poor showing of M in Tucker’s study 
is due to (a) the low mean number of M 
for his group, 1.81; and (4) the fact that 
85 per cent of his group had such a narrow 
range of M, zero to three. The relatively 
good showing of FM+-m is doubtless a func- 
tion of (a) the higher mean and sigma for 
this variable, and (4) the wider spread of 
certain scores, 87 per cent falling into the 
zero to five range. It may well be, also, that 
for neurotics FM is as good a measure of 
intelligence as is M since neurotic tendencies, 
according the Klopfer (5), appear to corrupt 
or debase the M tendency into an FM one. 


SUMMARY 
1. One hundred college undergraduates 
were administered the Group Rorschach, ac- 
cording to Munroe’s directions (6), and the 
Ohio Psychological Examination, Form 21. 
A previous study (1) had shown the M func- 
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tion to be the best measure of intelligence in 
the Rorschach, yielding an r of .41 with the 
Ohio, an eta of .63 with the same criterion. 
The regression of M on intelligence was sig- 
nificantly curvilinear for this group. 

2. The present study shows that the curvi- 
linear regression of M on intelligence is, ap- 
parently, a function of the unstereotyped M. 
Popular M yields an r of .19, an eta of .34 with 
the Ohio; the unstereotyped M (total M minus 
popular M) yields an r of .40, an eta of .68. 
The latter coefficient is significantly nonlinear 
(.oo1 lc.) while the eta for popular M is not 
significantly divergent from its r. 

3. FM correlates .19 with Ohio, while the 
eta is .35, a statistically nonsignificant dif- 
ference. The r of FM vs. M is .29; the eta 
is .59; the regression of M on FM is definitely 
curvilinear, the level of confidence being .oot. 
FM appears, therefore, to be somewhat re- 
lated to intelligence, probably in a linéar 
fashion; it is fairly closely associated with 
the M function, apparently in a nonlinear 
way. 

4. When the sum of M and FM is corre- 
lated with the Ohio, the obtained relationships 
(r of .33, eta of .50) are lower than for M 
alone. 

5. The clue to the saturation of M with 
intelligence appears to lie in the unstereo- 
typed, somewhat “original” M; popular M, 
FM and m, while correlating positively with 
intelligence, do so at a relatively low degree. 
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VEN nominal consideration of the ex- 
tensive literature in group therapy (11) 
suggests that this procedure has certain 

emotional consequences for the clinical pa- 
tient. Among these can be said to be facilita- 
tion of the participant’s ability to communi- 
cate his feelings with regard to past or present 
interpersonal relations (1, 12) and the facili- 
tation of identification with other members 
of the group and with the group therapist 
(1, 10, 15). The definition of identification 
to be used in this paper will be that of Gard- 
ner Murphy: 

Tendency to view oneself as one with another 

person and act accordingly (13, p. 989). 


Finally, a presumable third effect of group 
therapy, if the other two mentioned do in- 
deed obtain, might be facilitation of the indi- 
vidual’s clinical insight into the personality 
dynamics of other persons. Insight with re- 
gard to others will be used here, in an adapta- 


tion of a definition by Dymond (5, p. 228), to 
imply both the intellectual and emotional 
realization by the subject of the circumstances 
and relationships out of which the behavior 
of others has arisen. 

If these effects are obtainable for clinical 
patients in group therapy sessions, there 
seems no a priori reason to assume that they 
could not also be obtained for normal sub- 
jects in an experimental group. Through 
use of a control group, it might then be pos- 
sible to isolate variables in the group process 
most closely associated with the obtained con- 
sequences. 


Tue ProsLeM 


This study is, therefore, an investigation 
of conditions relevant to development of com- 


was derived 


* This study from a doctoral disserta- 
tion (2) completed at the University of Michigan in 1949, 
under the guidance of Dr. George A. Satter, Dr. Harold 
Guetzkow, and Dr. Donald G. Marquis. The writer 
wishes to acknowledge his indebtedness to these persons, 
whose cooperation and friendly interest made the project 
possible, while at the same time the writer assumes 
full responsibility for the present form of this study. 
The writer would also like to express his gratitude to 
Dr. Wilbert J. McKeachic, for his invaluable assistance 
in the design and execution of the thesis project, and 
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munication of feeling, identification and clin- 
ical insight in the small face-to-face group. 
The immediate problem was the determina- 
tion of differential effects on these variables 
of two contrasting group leadership tech- 
niques—group-centered, akin to the activity 
group therapy of Slavson (14), and leader- 
centered, patterned after traditional college 
classroom procedure. The differential effects 
of these techniques on perception of an ob- 
jective stimulus (3) and on interpersonal af- 
fect (4) in the small face-to-face group, have 
been discussed elsewhere. 

Specific hypotheses were: That discussion 
of a problem in human relations in a group- 
centered unit, compared to a leader-centered 
unit, would 

1. be more oriented towards analysis in 
terms of feeling rather than towards objective 
analysis; 

2. contain more references to the patient 
whose problems were being considered, thus 
indirectly showing more identification with 
the patient; 

3. evidence on the whole more understand- 
ing of the personality dynamics involved in 
the problem. 


METHOD 


Apparatus and administrative procedure. It was 
suspected that the deeper emotional consequences 
of group process could best be brought out by 
letting two comparable units, one group-centered 
and the other leader-centered, mull over a striking 
problem in the area of personality, presented 
through a film. 

It was thought, following an hypothesis suggested 
by Hutt (9), that the group member would deem 
it safer to express some of his deeper feelings in 
regard to a concrete, external problem that the 
whole group was discussing with him, than he 
would either in a private interview with an experi- 
menter or on an anonymous questionnaire. 

For this reason the 16 mm. sound film, “The 
Feeling of Rejection,” produced by the National 
Film Board of Canada, was shown to the two units 
involved in this experiment, after each had met for 
a total of 39 hours. Following the showing, each 
group was asked to discuss the problem of “what 
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made the girl in this film the way she was.” The 
teacher, who was the same for both sections, took 
no part in either discussion. 

Each discussion was transcribed on a wire re- 
corder for approximately 16 minutes. The exact 
time covered by each transcription was 16 minutes 
and 5 seconds for the group-centered-unit discus- 
sion, and 15 minutes and 29 seconds for the leader- 
centered-unit discussion. 

The Educational Film Library Association’s sum- 
mary of the film follows: 


The case of Margaret who at 23 had not yet learned 
to make decisions independent of her mother. When 
she goes to a psychiatrist to learn the reasons for her 
headaches and tired feeling, she reveals that when a 
child she was afraid of losing the love of her parents 
and, as a result, learned to acquiesce to all their 
demands. When she realizes the cause of her trouble, 
she begins to assert herself and become well-adjusted. 


Design. Two sections from the elementary psy- 
chology course in the 1948 spring semester at the 
University of Michigan were chosen for the ex- 
periment. Each section had 25 students, mostly 
sophomores. Of these, 22 were present in the 
group-centered section on the day of the film 
presentation, and 16 in the leader-centered. The 
number who actually participated in the discussion 
in each case was 12 in the group-centered, and g in 
the leader-centered section. Thus 57 per cent of 
those present in the group-centered class, and 56 
per cent of those present in the leader-centered 
class, took part in the discussion. 

The sections were roughly matched for score on 


the Otis Self-Administering Test of Mental Ability, 
higher form A (20 minute test period); grade point 
average for the previous semester, number of vet- 


erans, and number of girls (2). The respective 
figures for group- and leader-centered sections 
were: Otis form A, 54.0 and 55.0; grade point 
average, 2.37 and 2.81; number of veterans, 14 and 
11, and number of girls, 6 and 2. As can be seen, 
the only really marked difference between these 
units in original composition was the number of 
girls. 

Both sections had the same teacher. The course 
content, assignments, examinations, and relative 
amounts of time devoted to lecture, discussion, and 
psychodramas were kept the same for both sections. 
The two sections met for one hour three times a 
week during the 15-week semester. 

The procedural differences between group- and 
leadercentered technique have been discussed in 
detail in a previous paper (4) and need only be 
briefly summarized here. 

1. Interaction. In the group-centered process, member- 
to-member verbal interaction was encouraged by a 
number of specific techniques, including circular seating 
and deflection of leader-directed questions back to the 
group. In leader-centered process, such interaction, 
when it did spring up, was gently but firmly discouraged, 
and verbal interaction was largely channeled between 
the leader and individual members. 

2. Group decision. In the group-centered process, 
initiative as to a number of decisions concerning day-to- 
day procedure of the class, including timing of examina- 
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tions and length of lectures, was left to the class. 
Whenever the group-centered section came to such a 
decision, the leader merely instructed his leader-centered 
section to follow the same procedure. 

3. Leader's role. In the’ group-centered process, the 
teacher's role was fixed by the group following dis- 
cussion as member of the group, instigator, and resource 
person. In the matched section, the teacher fixed his 
own role as leader rather than member of the group, 
moderator, and resource person. 


Procedural variation. It was important in the 
present experiment to determine whether these pro- 
cedural differences in leadership technique, particu- 
larly in regard to verbal interaction and role 
assignment, really had been obtained as prescribed. 

In regard to interaction, a check made by outside 
observers who recorded one session each of the two 
sections during a three-week period that began 
with the 12th week of the semester, showed that 
the proportion of total member remarks directed 
to other members (rather than to the leader) was 
.61 in the group-centered class, and .10 in the 
matched leader-centered class. The unit of inter- 
action was what one person said at one time, 
regardless of how long he took to say it. The 
standard error of this difference was .o19, and the 
critical ratio obtained, 26.8, indicated that the differ- 
ence could be accepted as statistically significant at 
the 1 per cent level of confidence. 

As for group decisions, the teacher encouraged 
some on matters of day-to-day procedure in the 
group-centered unit, but none in his matched 
leader-centered group, as his daily diary for the 
two sections indicates (2, appendix). 

A check on the required role differences, made 
at the end of the semester by means of an anony- 
mous questionnaire, revealed the somewhat surpris- 
ing information that a majority of the members of 
both sections perceived their teacher as having been 
a member of the group. Further, there was no 
significant difference between the two units in 
rating of the teacher on an 11-point affect scale (4), 
or in evaluation of the teacher's effectiveness (2). 
In each of these three variables, there was a slight 
bias in favor of the group-centered section, but not 
enough to be statistically significant for any one 
variable. All ratings were, of course, anonymous. 

Finally, it should be noted that no member of 
either section checked the questionnaire item, “The 
leader doesn’t give the members of the class enough 
emotional support.” However, 36.4 per cent of 
the leader-centered section checked the item, “The 
leader tends to forget about some people in the 
class,” on the same anonymous questionnaire, com- 
pared to 5 per cent of the group-centered unit. 
This difference is statistically significant at the 
1 per cent level of confidence, and indicates at 
least some difference in the way subjects in the 
two experimental groups perceived their common 
teacher’s respective roles, 

The over-all results suggest that, although the 
actual functioning of the teacher had to be quite 
different in group-centered compared to leader- 
centered process, his role was perceived as much 
the same, with the exception noted above, and he 
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was liked and respected as a teacher to approxi- 
mately the same extent in both groups. 


RESULTS 


Communication of feeling. To test the 
hypothesis that the discussion of the film 
problem by the group-centered unit would 
be more oriented towards feeling than the 
comparable discussion of the matched leader- 
centered unit, an affect rating scale was 
devised for coding of the two discussion type- 
scripts. Points on the scale ranged from o 
for a completely objective analysis,” with no 
feelings expressed, to 4 for a completely feel- 
ing-oriented statement,® with no objective 
analysis present. Units for the coding were 
the participation segments (what a person 
said at one time in sequence) in the two 
typescripts. Results of the coding, presented 
in Table 1, support the initial hypothesis. 


TABLE 1 
Mean Arrect Ratincs For Discussion Participa- 
TION UNITs 


SEaire CR 


Mean, G/C Mean, L/C 





4-59 


Identification. The second hypothesis was 
that the discussion of the group-centered unit 
would contain more references to the girl 
whose problems were being considered in the 
film, thus indirectly evidencing more iden- 
tification with her. This hypothesis was also 
tested by coding. Ideas in the discussion were 
the coding units. As Table 2 indicates, the 
results support the hypothesis. 

Reliability of coding. To obtain reliabil- 
ity estimates of these two codings, a second 
coder was asked to do a check coding on 
both. The reliability between the two coders 
was computed according to formulas de- 
veloped by Guetzkow at the University of 
Michigan Conference Research Project (8). 

J 
The theoretical proportion (Pineor) of agree- 
ment between two coders assumed to have 

2 Example of completely objective analysis from the 
typescripts: “I think that it . . . I don’t know what 
you call it because I haven't studied my terms but I 
think her main trouble is a childhood dependence. . . .” 

8 Example of a completely feeling-oriented statement 
from the typescripts: “I don’t think it was a matter of 
not wanting to do the ironing but she wanted to go 
to the movies too.” 
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TABLE 2 


Proportion OF IpeEA Units Copep 1n CaTEeGoRY OF 
REFERENCE TO PATIENT 








LEADER- 
CENTERED 


Group- 
CENTERED 





n prop. n prop. 





98 .888 59 <.01 


+203 





equal ability is calculated at the lower limit 
possible, given the obtained proportion of 
agreement between Coders 1 and 2, at the 
1 per cent level of confidence (8, p. 53). The 
obtained proportion of agreement is of course 
one sample of the Pieor- Agreement is here 
defined as the proportion of units coded the 
same way by the two coders, to total units 
coded. Using the lower limit of Pryor a 
value for p, the reliability estimate, is derived 
(8, p. 51). This is the least value* p might 
take 99 times out of 100, given the obtained 
agreement, the number of units (n) and 
the number of categories (k). The reliabil- 
ity estimates derived are presented in Table 
3 
TABLE 3 


Copinc PrRocepuREs 


EsTIMATE OF RELIABILITY OF 





RELIABILITY 
Copr 4 





Affect 96 
Reference to 
patient 62 








It will be noted that the reliability for the 
affect scale coding is rather low. To alleviate 
any doubt as to the reliability of the difference 
found between the means for the two sections 
on this scale by Coder 1, the coding of Coder 
2 was used to compute these means, and the 
results compared. This comparison can be 
seen in Table 4. 

Thus although the means on this scale for 
both sections are consistently higher for 
Coder 1 than for Coder 2, nevertheless the 


4It should be noted that when two coders randomly 
classify material into categories, where k equals the 
number of categories, the p becomes 1/k (8, p. 51). If 
the two coders agree completely, then p becomes 1.00. 
Thus p represents the reliability of the coding. 
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difference between means of the group-cen- 
tered unit and the leader-centered unit is 


TABLE 4 


Mean Arrect Ratincs MADE ON PARTICIPATION 
Units sy Two Copers 


Cover Mean,G/C Mean,L/C SEaire 





2.67 1.43 .270 4-59 
-63 -65 .208 4-71 <.01 





about the same for both coders, and the crit- 
ical ratio of the difference between means 
over the standard error of that difference 
is about the same. Therefore despite its low 
reliability, the affect scale coding seems to 
present evidence of a real difference between 
the means of the two sections on the scale. 

Another estimate of the reliability of the 
coding procedure was obtained by comparing 
the actual agreement found between Coders 
1 and 2 with that to be expected purely by 
chance (8, p. 51). The results are shown in 
Table 5. 

TABLE 5 

AGREEMENT OBTAINED BY Two Copers CoMPARED TO 


AGREEMENT EXPECTED BY CHANCE IF UNITS 
Are CATEGORIZED AT RANDOM 





Prop. 
BY CHANCE 


Prop. 


Osr. SEaire CR 


Cope 





Affect 
Reference to 
patient 


.0648 3-35 <.01 


-0764 4.86 <.o1 





Understanding of dynamics. A third propo- 
sition formulated concerning the conse- 
quences of the two kinds of group process 
was that group-centered members would evi- 
dence more clinical insight into the person- 
ality dynamics of other persons than would 
leader-entered members. The specific hy- 
pothesis was that the typescript of the 16- 
minute discussion by the group-centered 
class on the problem, “What made the girl 
in this film the way she was?” would evi- 
dence better clinical insight than would the 
comparable typescript for the leader-centered 
class. It should be recalled here that both 
sections had the same teacher, and had ex- 
actly the same course content, organization, 
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assignments, and role-playing situations, the 
major difference between them being the ex- 
tent to which section members talked to one 
another in class meetings. 

The two typescripts were submitted to two 
clinical psychologists, hereinafter referred to 
as Judges A and B,° for their independent 
evaluation of the degree of clinical insight 
evidenced in each typescript. The evaluators 
first read the synopsis of the film presented 
above and a brochure which included a num- 
ber of stills from the film, together with ex- 
soe comment and selections from the 
dialogue, prepared by the distributors of the 
film, the National Film Board of Canada. 
Neither of the two judges had seen the film. 
They knew nothing of the purpose of the 
experiment and nothing about the nature of 
the two groups that produced the discussion 
typescripts, beyond the fact they were classes. 

Following is Judge A’s evaluation. “First 
recording” refers to the group-centered unit, 
“second recording” to the leader-centered. 


It is a bit like guessing in the dark to attempt 
to evaluate the nature of clinical insight with regard 
to this picture, if one doesn’t know the picture 
thoroughly. However, some things do appear quite 
clear to me: 


First recording. This group is sensitive to the 
expression of feelings, types of mechanisms used to 
deal with conflicts and the varied and inter-related 
aspects of behavior of the “heroine.” The group 
senses much of the problems of an oedipal nature 
and the sibling and social difficulties and has a 
glimmering of the significance of repression and 
denial. 

Most of all, I'd like to comment on the marked 
degree of interaction and spontaneity of the group. 

Second recording. 64° produces strong, nega- 
tive countertransference in me! He dominates the 
group, stifles spontaneity, leads them to intellectual- 
izing and label-pinning. 

The group is insecure, aggressive, and formalistic. 

Little insight is shown by (most) members into 
the underlying dynamics. Major concern is with 
descriptive symptom elucidation and nosological 
considerations. As a group, this group shows far 
less understanding of the “heroine's” difficulties. 


5SJudge A was Professor Max L. Hutt, Associate 
Professor of Psychology and Psychologist in the Neuro- 
psychiatric Institute at the University of Michigan, and 
Judge B was Dr. Daniel R. Miller, Assistant Professor 
of Psychology, and Chief, Clinical Services Division, 
Bureau of Psychological Services of the University of 
Michigan. 

6 This number identifies one of the subjects. Further 
data on the subjects who took part in this experiment, 
including affect scale ratings and notes on daily class 
behavior, can be found in the complete thesis (2). 
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Following is Judge B’s evaluation: 


First recording. 

Better insight. 

Discussed real 
“psychoneurotic.” 

No one dominated the discussion or prompted 
others. 

This group seemed to be less frightened by the 
issues of the film and could take it more seriously 
and less defensively than the other. 

More questions were asked. An attempt was 
made to answer cach by the group as a group. 

I'd suspect the second group were psychology 
students, not the first. If this is true, it corroborates 
my hypothesis that training in psychological theory 
handicaps insightful thinking about people if it 
occurs without personal therapy or in a context of 
detailed case histories. 

Second recording. 

64 was very dominant and assigned the problem. 

64 seemed to need to isolate his feelings, espe- 
cially fear. 

64 set the problem off on an abstract verbal and 
meaningless talk. 

64 seemed to need to avoid insight in the prob- 
lem (his? he says so once) of the picture. His fear 
was displaced to the microphone. 

The group could never get away from labelling 
with “black” names and thus was not enabled to 
cope with issues. 64 always turned the topic back 
to name-calling when others like 54 began to discuss 
issues. Thus he did not interfere when the subject 
changed to self-control which was safe for him. He 
was ably abetted by 51’s topic manipulation. 


issue, not abstract words like 


From the evaluations of Judges A and B, 
the two sections appeared so different after 
39 hours of mainly member-to-member verbal 
interaction in the group-centered, and 39 
hours of mainly leader-member verbal inter- 
action in the leader-centered, that they now 
were scarcely recognizable as having come 
from the same general population of small 
groups. In fact Judge B suggested that the 
group-centered section was not a psychology 
class at all. 

The reports of the two clinical psychologists 
appear to agree that the group-centered sec- 
tion displayed more clinical insight into the 
problem in human relations that confronted 
it. The hypothesis that group-centered lead- 
ership leads to better understanding of per- 
sonality dynamics than leader-centered, is 
therefore supported. 


Discussion 


Interaction and the communication of feel- 
ing. It appears difficult to account for the 
difference in verbal communication of feeling 
obtained solely in terms of the major pro- 
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cedural variable, the difference in amount of 
member-to-member verbal interaction. It has 
been shown previously (4), however, that a 
high degree of member-to-member verbal in- 
teraction in the small face-to-face group, such 
as was obtained in the group-centered unit 
of the present experiment, tends to result in a 
high level of interpersonal affect among 
group members. Affect as here used refers 
only to the liking or disliking of group mem- 
bers for one another, as measured on an 11I- 
point rating scale (4) with minus five for 
extreme dislike, zero for neutrality and plus 
five for extreme liking. 

In an atmosphere of heightened positive 
affect, work in group therapy further sug- 
gests (12, 15), individuals will feel freer to 
communicate their feelings—hostile or 
friendly, as the case may be. 

It can therefore be suggested that verbal 
interaction is not a sufficient condition for the 
communication of feeling. This latter higher- 
order process will only be manifested when 
the delicate threads of positive affect have 
already been established between members of 
the small group, through verbal interaction, 
or by other means. The emotional rela- 
tionship is prior to the communication of 
feeling. 

The positive affect upon which the com- 
munication of feeling is based in a given 
interpersonal relationship may be, of course, 
itself based on unrealistic and distorted per- 
ceptions of that relationship. The present 
argument is that, so long as the affective 
base is there, communication of feeling will 
be facilitated, whether the affective relation- 
ship is anchored in probably true or in autis- 
tic perceptions. 

Insight and role-taking. A direct connec- 
tion between the major independent variable 
of the present experiment, the amount of 
member-to-member verbal interaction, and 
the difference obtained between the two 
groups tested in degree of insight into person- 
ality dynamics, is also difficult to establish. 

It can be suggested, however, that the 
process leading to the increase in clinical un- 
derstanding among group-centered members 
was their heightened empathic ability, as de- 
fined by Dymond: 

. . . the imaginative transposing of oneself into 

the thinking, feeling and acting of another and 

so structuring the world as he does (6, p. 127). 
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Empathy may therefore be definéd as the staff of the University of Michigan then 
ability of an individual to respond to the made blind analyses of both typescripts, inde- 
meaning a particular situation has for an- pendently of one another, to determine the 
other, and it may thus be said to involve the _ relative amount of clinical insight shown by 
accurate prediction of the other’s response to each group into the problem. 
the situation, including symbolic processes, 1. Group-centered process leads to greater 
feeling, and overt behavior. communication of feeling than leader-cen- 


The greater understanding of the group-_ tered. 
centered members may be due to their greater 2. From indirect evidence, it appears that 
ability to take the role of the girl in the film group-centered process leads to more identi- 
whose emotional problems were being pre- fication with another person than leader- 
sented, and to perceive and feel those prob- centered. 
lems from her point of view—from inside as 3. Group-centered process leads to greater 
well as outside. The increase in empathic clinical insight into personality dynamics 
ability for the group-centered section can be than leader-centered. 
related in theory to the greater amount of REFERENCES 
verbal interaction taking place there. Such | pisucu, Dororny W., & Mute, H. Interview 
interaction probably allows the correction of group psychotherapy with allergy patients. In 
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and theoretical, approaches to hostility, 

communication, and international ten- 
sion. It extends the series of studies in this 
area which has been conducted during the 
past three years. This series represents an 
attitudinal approach to international prob- 
lems. 

After having investigated hostility (3) and 
social distance (2) using two independent in- 
ventories administered to the same group of 
college students and having correlated the 
scores on each inventory for the subjects un- 
der consideration, no significant relation- 
ships were found to exist between the two 
measures (4). However, in discussions of 
prejudice, hostility and social distance are 
often taken as synonymous or highly posi- 
tively correlated variables. 

In the study on geo-ethnic preferences it 
was found that the nations from Asia and 
the Middle East were least liked by the sub- 
jects. If current international political dis- 
putes are of a hostile nature, then an anomaly 
remains since those nations against which we 
tend to be most politically hostile are not the 
countries which are most disliked. 

Another problem is apparent in the rela- 
tionship between knowledge and social dis- 
tance. Many programs of attitudinal change 
are based upon the assumption that the more 
one “knows” about another person, the less 
prejudice or hostility he will direct toward 
this other individual. This does not explain 
why individuals who are among the most 
hostile in their reactions toward other na- 
tions often have a great deal of “informa- 
tion” about such cultures. Therefore, we 
rationalize that it is not what the subject 
knows about another culture which deter- 
mines his attitude, but what he thinks he 
knows. 

The question posed for this study is: What 
is the relationship among hostility, social 
distance, and “knowledge” about various 
nations? We suspect that “knowledge” is 
not synonymous with positive attitude, nor 
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hostility with negative attitude, nor “knowl- 
edge” with the absence of hostility. Our 
hunch is that the relationship is more com- 
plex than has heretofore been expected. 

We seek a law which will define for us the 
relationship among the three variables in an 
operational manner and will hold for the 
individual case as well as for any number of 
subjects tested. 

The method used in this study is designed 
to satisfy certain criteria. (a) The test must 
be applicable to a variety of geo-ethnic areas 
and to different problems and subjects. (4) 
The basic statistical design must include the 
following: breadth and depth of possible 
analyses, flexibility of the instrument to per- 
mit the removal of particular variables for 
special study, and provision for both rank 
and scalar scores in order that both direction 
and intensity may be measured. (c) Equiva- 
lence and stability reliabilities must be high. 
(d) An instrument is desired which may be 
used by other investigators with statistically 
similar results. Therefore, a direct and non- 
projective test must be developed. (¢) The 
time required to respond to the test must be 
brief and group administration must be 
feasible. (f) The test must be easily scored, 
by machine if possible, and adapted for use 
by semiskilled persons. 


METHOD 


Three problems, hostility, “knowledge,” 
and social distance (“liking”) are under con- 
sideration. Each constitutes a separate in- 
ventory. The three questions are presented 
in Example tr. 


Example 1 

“T feel more hostile and aggressive 
toward, and would rather go to 
war against —— , 


Hostility: 


“T have more accurate knowledge 


“Knowledge”: 
and information about ———— than 
oa 


“T like and would rather associate 
with the people of ———— than 


Liking: 
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A second part of the scale consists of the 
nations to which the subject is asked to 
respond. These ten nations are chosen as 
representatives of the major social, ethnic, 
and political areas of the world: Brazil, 
Egypt, England, France, Germany, India, 
Japan, Nigeria, Norway, and the U. S. S. R. 

Example 2 indicates the way each scale is 
presented. 


Example 2 

“I feel more hostile and aggressive toward, and 
would rather go to war against ———— than ———.” 
much 
more 


about 
equal more 


much 


more more 


Brazil [V} O O O 0 
This means that the person who checked this “feels 
more hostile and aggressive toward, and would 


rather go to war against Brazil much more than 
Egypt.” 


Egypt 


Matching each nation once with each other 
nation, the inventory consists of 45 paired 
comparisons. The scale question is repeated 
after every nine items. The subject (S) is 
given two examples in the preliminary in- 
structions. He then proceeds through the 
inventory placing a checkmark in the box 
which most clearly represents his reaction to 
the particular item under consideration. 

The score for a given scale area, ie. a 
particular nation, is defined as the number 
of blank spaces up to the checkmark when 
counting from that area to the checkmark. 
For Example 2 above, Brazil would receive 
“o” points, and Egypt “4.” The score for a 
given scale item equals the number of blank 
spaces up to the checkmark. Therefore, for 
each scale item on the inventory, each of the 
two areas receives a score. The score for any 
scale item always totals four points (0-+-4; 
1+2; 2+2; 3+1; 4+0). We assume that 
the points between the response boxes are 
equidistant on each scale. The initial scoring 


Example 3 
Item 
Brazil-Egypt 
(A) Actual score (Brazil, a; Egypt, b) 
(B) Estimated score for Brazil (c) 
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was done by hand, but has been adapted for 
IBM equipment. 

The inventory takes approximately eight 
minutes to administer. Less than 1 per cent 
of the subjects omitted items on the inventory. 

Five hundred undergraduate students in a 
beginning course in psychology took these 
inventories during the fall of 1950. The tests 
were administered at two-week intervals. 


Possible effects due to the order of adminis- 
tration were accounted for by presenting the 
three inventories in six different sequences. 
Early in 1951 a smaller group of fifty students 
took a test and retest battery consisting of 
two of the inventories. The retest inventories 
were pied forms of the originals. 


RESULTS 


Reliabilities. Two methods for the study 
of equivalence are used to determine the 
reliabilities of this instrument. A third meas- 
ure for the study of stability is also used. 

Equivalence as homogeneity. In_ this 
analysis we are attempting to answer the 
question, “How is each item related to the 
total inventory?” The total actual score on 
a given comparison item for the subjects 
tested (for example, Brazil-Egypt) is corre- 
lated in a product-moment manner with the 
sum of actual scores on all of the other eight 
comparison items for the seme subjects. 

Table 1 summarizes the product-moment 
correlations for this measure of homogeneity 
for the subjects tested. There is no reliability 
coefficient for the entire inventory, but one 
for each of the ten scales. Reliability coeffi- 
cients range from .74 to .99 with the majority 
exceeding .go. The second column on the 
table represents a computation of the relia- 
bility coefficients for the hostility inventory 
when the scores are converted into loga- 
rithms. The logarithmic scores account for 


Operations 


Computed by method given in Example 2. 
Computed by summing actual scores of Brazil with all other 


nations except Egypt. 


(C) Estimated score for Egypt (d) 


Computed by summing actual scores of Egypt with all other 


nations except Brazil. 


(D) Homogeneity r for the Brazil scale 


Computed by correlating a:b with c:d for each paired compari- 


son in which Brazil appears, i.¢., the nine nations with which 
Brazil was paired. 
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the use of ratios in calculating the product- 
moment correlations. Computing the relia- 
bility coefhcients by means of logarithms 
tends to increase the size of the coefficient. 

Equivalence as represented by a. The 
formula for a which is used is Cronbach’s re- 
vision of the Kuder-Richardson formula (1). 
The question asked here is, “Do scores rep- 
resent individual differences or are they 
primarily a cultural norm?” A value for a 
exceeding .60 is taken as representing indi- 
vidual differences. Table 1 presents the « 
coefficients for each of the subtests on each of 
the inventories. The a coefficients range 
from .59 to 89. 
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disregards the specific nations. We are ask- 
ing, “How is the rank of a given variable 
related to the score on that variable for the 
subjects under consideration?” 

Each curve in this pattern analysis has been 
determined by summing the scores for all Ss 
for the rank order (one through ten) under 
consideration, regardless of the nations which 
achieved each particular rank. 

In Fig. 1, H represents the plotting of the 
score of hostility on the rank order of hos- 
tility. The pattern is curvilinear. This pat- 
tern justifies our using a scoring as well as 
a ranking procedure. 

K represents the relationship between the 


TABLE 1 


Reviapitiry CorrFicieNts Fork THE Hostitity, KNowLEDGE, AND Lik1nGc INVENTORIES 


(H represents the hostility inventory, K the knowledge inventory, and L the social distance inventory) 








EQuIVALENCE 





HOMOGENEITY 





H log H K 


STABILITY 








Brazil -93 -98 
Egypt -94 .98 
England -96 -98 
France -93 .98 
Germany -94 
India -95 
Japan -9! 
Nigeria .96 
Norway -96 
U.S.S.R. .89 





Reliability as stability. Stability was tested 
for two of the inventories in the test-retest 
manner described in the previous section. 
The question answered by the stability test 
is, “To what degree have subjects developed 
a response set?” Pearson product-moment 
correlations are determined from individuals’ 
scores on the test and retest. The reliability 
coefficients range from .56 to .gt (Table 1). 

The Ss under specific consideration in 
analysis of the data are 50 persons chosen on 
the basis of having a perfect rank order on 
the social distance (“liking”) scale. The 
reason for this choice will be apparent in the 
succeeding paragraphs. Studies have been 
made of seven, ten, twenty, and fifty subjects 
without any apparent differences in the 
graphs for the expression of these attitudes. 

Pattern analysis. The pattern analysis rep- 
resented in the first three curves in Fig. 1 








4 6 8 10 
RANK 
Fic. 1. Tue Score or Each VARIABLE IN RELATION 


TO THE RANK ORDER ON THAT VARIABLE: Hos- 
Tizity, H; KNowrence K; Lixine, L 
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rank and the score on the knowledge inven- 
tory. The relationship here is linear. 

L represents the relationship between the 
rank and score of the social distance inven- 
tory. This relationship is also linear. 

Nation analysis. In Fig. 2, Graph H rep- 
resents the reflected curve of the relationship 
between the score of hostility and the various 


L 














OEFBIYNGJU EFGJOUBIYN NEFBGIYNIU 
RANK 


Fic. 2. THe Score or Eacn Nation on Eacu 
VARIABLE IN RELATION TO THE RANK ORDER OF 
THE NATION FoR THAT VARIABLE: Hosti1- 
iry, H (Rervectrep); KNow.epce, K 
(Rertectep); Lrxine, L 


(Brazil, B; Egypt, Y; England, E; France, F; Ger- 
many, G; India, 1; Japan, J; Nigeria, N; Norway, O; 
Soviet Russia, U.) 


nations in their rank order for the subjects 
tested. The graph for the expression of hos- 
tility is curvilinear. 

Graph K indicates the relationship between 
the rank of nations and the reflected total 
score for the subjects on the knowledge in- 
ventory. The curve is ogival and reflected. 

Graph L presents the relationship between 
the rank of nations and the total score for the 
subjects on the social distance inventory. 
These points might be fitted either with an 
ogive or a linear curve. 

The relationship pattern analysis. Again 
we disregard the specific nation and refer to 
the pattern of behavior under consideration. 
Figure 3 indicates the Ss’ reflected hostility 
scores based upon the rank order of social 
distance. (This is why Ss with perfect 
“liking” rank orders are chosen for this 
analysis.) For each S we compute the rank 
order of social distance for the countries 
under consideration. We then find for each 
of the rank orders the respective hostility 
score on that country. The hostility scores 
for the particular rank orders of the fifty Ss 
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2 4 6 - 10 
RANK “LIKING” 


Fic. 3. THe Score or Hostititry anp KNOWLEDGE 
IN RELATION TO THE RANK Orper oF LIKING: 
Hostitiry, H (Rercecrep); KNow.epce, 

K (Reriectep); Lixine, L 


are summed. This relationship portrays a 
discontinuous curve or two separate straight 
lines. 

Figure 3 also represents the reflected total 
scores on knowledge of the cultures for the 
Ss with reference to the rank order of nations 
according to their social distance. The curve 
tends to be parabolic. 

Figure 3 indicates the relationship between 
the total hostility and knowledge scores on 
the basis of the rank order of social distance 
of the nations. Hostility and knowledge vary 
inversely to the point of least knowledge; 
from the point of least knowledge onward, 
hostility and knowledge vary directly, with 
hostility accelerating more rapidly than 
formerly. 

Nation relationship analysis. The rank 
order on each variable for the ten nations 
under consideration is plotted on a graph of 
triangular coordinates. Three patterns are 
evident in Fig. 4: (a) Norway, England, 
and France; (4) Brazil, India, Egypt, and 
Nigeria; (c) Germany, Japan, and the Soviet 
Union. 


Discusston 


The method used in this study has been 
found fruitful for obtaining measures both 
of intensity and direction of feeling. Any 
one variable may be removed from the total 
inventory and scores obtained for the nine 
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Fic. 4. THe RANK Orper oF EacH NatTIOoN ON 
Each VagiaBLe PLoTTED ON TRIANGULAR Co- 
ORDINATES: Hostitiry, H (REFLECTED); 
Know tence, K; Lixinc, L 


(Brazil, B; Egypt, Y; England, E; France, F; Ger- 
many, G; India, I; Japan, J; Nigeria, N; Norway, O; 
Soviet Russia, U. Tie different types of lines represent 
different clusters of nations.) 


remaining variables without appreciably 
affecting the results. The inventory is brief, 
easily administered to groups, and easily 
scored. 

The reliabilities of these inventories are 
very high. The homogeneity test indicates 
that any given scale is very highly related to 
the total inventory. The a test indicates that 
scores represent individual differences rather 
than a cultural norm. The stability study 
indicates that the inventory elicits relatively 
stable responses, but that there is some vari- 
ation over a period of time. We should, 
therefore, be able to study changes brought 
about in the time intervening between two 
test administrations. 

The curvilinear relationship between the 
rank and the score on the hostility inventory 
is probably a crucial reason for the confusion 
found in some earlier studies. The rank- 
order method works well for the amount of 
knowledge and social distance inventories, 
but the hostility variable requires a more 
refined procedure. 

The curvilinear relationship between the 
hostility rank and score is replicated in the 
study of nations. However, both the knowl- 
edge and liking variables approximate ogive 
curves. This ogive relationship tends to 
reflect the adequacy of the criteria used in 
the selection of nations for this study. 


Of greatest aid in understanding the rela- 
tionships among the variables studied is the 
use of triangular coordinate graphs. The 
irregularities in these relationships are due 
to the discrepancy in the amount of knowl- 
edge about Norway and the high degree of 
liking Germany. Except for these two vari- 
ations, the entire figure would consist of 
isoceles triangles. The three patterns of rela- 
tionship evident in Fig. 4 provide a validity 
check of a rational nature on the generaliza- 
tions suggested in the following paragraphs. 

The data may be summarized in one gen- 
eral statement and corollary. 

1. Given the rank order of social distance 
from the self or ingroup to others and the 
amount of knowledge and information felt 
to be known about each variable, Aostility 
varies inversely with knowledge to the point 
of least knowledge and directly with knowl- 
edge thereafter. 

(a) The point at which hostility will have 
the greatest probability of being expressed, 
i.c., the scapegoat point, will be found in the 
region where the curves of knowledge and 
hostility change direction. 

In light of these data the following ques- 
tions are of great importance to the applied 
problems of hostility. What do we change 
when we say that we have “changed hos- 
tility’? Do we change the target or the 
pattern? If so, how and for how long? 


SUMMARY 


Three different attitude inventories were 
administered at two-week intervals to 500 
college students in the fall of 1950. They 
were administered in six different orders to 
account for practice effect. 

Each inventory elicited responses indicat- 
ing the intensity of the S’s choice between 
paired variables. Each inventory required 
eight minutes to answer. 

Reliability coefficients reflecting two meas- 
ures of equivalence and one of stability were 
computed. The inventories were found to 
be highly reliable with regard to these criteria. 

Relating rank to score, knowledge and 
social distance are linear, while hostility is 
curvilinear. 

Given the rank order of social distance, 
hostility and knowledge vary inversely to the 
point of least knowledge; from the point of 
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least knowledge onward, hostility and knowl- 
edge vary directly with hostility accelerating 
more rapidly than formerly. 
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EFFECTS OF PERSONAL, IMPERSONAL, AND PHYSICAL STRESS UPON 
COGNITIVE BEHAVIOR IN A CARD SORTING PROBLEM 


BRUCE M. ROSS, JOHN W. RUPEL, anp DAVID A. GRANT? 


University of Wisconsin 


His study investigates the differential 

effects on abstract behavior produced by 

administering the Wisconsin Card Sort- 
ing Test (WCST) under eight combinations 
of personal, impersonal, and physical stress. 
The WCST has been described earlier (1, 4, 
5, 6, 7) as a technique for investigating ab- 
stract behavior, and has been used to demon- 
strate clear-cut differences between schizo- 
phrenic patients and normal controls in this 
respect (3). 

Distraction and other stresses presumably 
affect complex. behavior (2, 11, 15), but little 
has been done to analyze what sorts of stress 
are effective. Also little is known about the 
aspects of cognitive behavior which are mest 
vulnerable to distraction and emotional stress 

2, 8, 9, 11). Since the scoring protocol of 
the WCST permits analysis of sorting errors 


into perseverative and nonperseverative clas- 
sifications, it is possible to investigate one im- 
portant thesis advanced by Hamilton (10, 16) 
and by Maier (12, 12) that stress and frustra- 
tion increase perseverative tendencies, even to 


“ 


the point of leading to a “persistent non- 


adjustive reaction.” 


Description oF TEST 


The WCST consists of four stimulus cards and 64 
response cards, which were devised so that each card 
contained from one to four identical figures of a 
single color. Four kinds of figures were used: stars, 
crosses, triangles and circles. Four different colors 
were used: red, yellow, green, and blue. A single 
card might have four red stars, or two green circles, 
or any of the 64 possible combinations of colors, 
forms, and numbers. Each card could then be 
sorted or categorized according to the color, the 
form, or the number of the figure. A sorting tray 
consisting of four double compartments was used. 
The four stimulus cards put before S in the upper 
set of compartments from left to right were: one 
red triangle, two green stars, three yellow crosses, 
and four blue circles. 

The S was given the pack of response cards and 


1 The experiment was planned jointly by all three 
authors; the laboratory work and statistical computations 
were carried out by Ross and Rupel; the general analy- 
sis, interpretation, and responsibility for the final manu- 
script devolved upon Grant and Ross. 
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was instructed in the following manner: “I want 
you to put these cards into four groups under- 
neath the ones lying on the tray. For each card, 
I will tell you whether you are ‘right’ or ‘wrong.’ ” 

The initial “correct” sorting stage was arbi- 
trarily determined in advance to be color. As S§ 
sorted the response cards he was informed on each 
card whether he was: “right” or “wrong.” As soon 
as S sorted ten consecutive cards correctly, E 
shifted, with no explanation to S, and began to 
call the form classifications “right” and all others, 
including color, “wrong.” In this way, the “cor- 
rect” classification or concept was later shifted from 
form to number, then back to color, followed again 
by form and number stages in the same order as 
before. (The S’s only cue to the shifts was in E’s 
“rights” or “wrongs.”) The 64 response cards were 
always presented in a standardized order so that no 
like color, form, or number followed each other. 
The experiment was considered satisfactorily com- 
pleted when the S had finished the six stages or 
when an experimental “failure” occurred, which 
meant S had sorted four decks of 64 response cards 
without completing stage six. 

The Ss were men and women elementary psy- 
chology students at the University of Wisconsin. 
Twenty different Ss satisfactorily completed the 
test for each of the eight conditions. In addition, 
there was a total of 21 “failures” to sort all stages 
correctly. The “failures” were distributed appar- 
ently at random, over all experimental conditions, 
and the results obtained from these “failures” were 
discarded from the present analysis. 


PRocEDURE AND EXPERIMENTAL DesIcNn 


The eight conditions of distraction involved: 
verbal “heckling,” electric shock, and a recorded 
auditory distraction. Each of these conditions was 
administered separately and also in all possible 
combinations with the remaining distraction fac- 
tors. In addition, a control condition was run 
in which the standard procedure was followed with 
no distraction. 

Under “shock” procedure, two electric shocks 
were administered to the wrist of S’s nondominant 
hand during the first and third stages, being given 
after the seventh consecutive correctly sorted card 
of the first (color) stage and after the third con- 
secutive correctly sorted card of the third (num- 
ber) stage. Shock was thus not a cue to indicate a 
shift in stages. The strength of the shock was 
adjusted before the experiment so as to be unpleas- 
ant but not painful. The presence of the electrodes 
constituted a continuous threat, as S never knew 
when to expect the shock. The personal “heck- 
ling” condition utilized a timed pretest that was in- 





Cocenitive BeHavior Unpver Various Types or Stress 547 


capable of solution. The pretest was formed by a 
judicious amalgamation of two separate O’Connor- 
type Wiggly-Block Tests, plus continuous, personal, 
verbal, deprecatory heckling by E during both pre- 
test and WCST administration. The auditory im- 
personal “distraction” condition was brought about 
by the playing of a potpourri of fragmentary radio 
program excerpts recorded at varying loudness levels 
on a wire recorder. 

The eight conditions of administration formed a 
tri-factorial design permitting the partitioning out 
of the three main effects, the three first-order inter- 
actions, and the second-order interaction in all the 
analyses of variance subsequently performed on the 
test scores. Since two Es administered the test, 
the decision was made to confound completely the 
second-order interaction effect with experimenter 
differences (14, pp. 102-108), thus eliminating this 
source of variation from the error estimate. 


RESULTS 


All responses for each test were recorded 
and scored on the basis of the following clas- 
sification.” 

a. Perseverative Errors. Cards sorted on 
the basis of the previously correct category, 
except for an initial uncounted perseverative 
error needed to inform S of the change in 
stage, were classed as perseverative errors. 

b. Nonperseverative Errors. Cards sorted 
incorrectly on some basis other than the previ- 
ously correct concept were classed as non- 
perseverative errors. 

c. Correct Responses. Only those correct 
sortings made after the uncounted persevera- 
tive errors and before the ten consecutive 
correct responses which evoked a stage shift 
were classed as correct responses.* 

The results are summarized in Table 1. In 
Table 1 are given the mean number of per- 
severative errors, nonperseverative errors, and 
correct responses per stage for all eight ex- 
perimental groups. Examination of Table 1 
reveals the general trend of the results 
throughout the six successive stages and the 
general effects of the independent variables 
on these scores. It is seen that the total cor- 
rect score rises following Stage 1 to a maxi- 
mum at Stages 2 and 3. Perseverative errors 
tend toward a maximum at Stage 3 but the 
number of nonperseverative errors drops fol- 
lowing the first stage, rises again on Stage 

2 The scoring system followed was identical to that 
used by Grant (4). 

3 The correct-responses score was the number of cues 
or reinforcements required to establish the new re- 
sponse. Thus a high number of correct responses would 
indicate slow or inefficient learning. 


3, and drops thereafter. In general, all scores 
show that the eight groups become very sim- 
ilar for the last three stages. The procedures 
which include electric shock seem generally to 
produce more errors and require more correct 
responses than the other four procedures. In 
general, the lowest error scores were achieved 
by Ss of the control group who were ad- 
ministered the WCST under standard con- 
ditions. 

In Table 2 the total scores from the last 
column of Table 1 have been subjected to 
analysis of variance and the effects of the 
primary variables and their interactions have 
been separately evaluated. It is seen that the 
electric shock is the only variable which pro- 
duces statistically significant effects on the 
total scores. Further, the significant changes 
occur only in the total correct responses and 
the nonperseverative errors. 

Since shock is the only variable with an 
over-all effect it is interesting to observe how 
shock affects the various error scores at each 
stage of the experiment. In Fig. 1 the total 
correct, perseverative errors, and nonpersever- 
ative errors have been plotted at each stage for 
the four shock groups and for the four non- 
shock or control groups—as is readily possible 
in factorial designs (14, pp. 89-93). In Fig. 
1 it is readily apparent that such differences 
as do occur appear in the first three stages. 
The chief effects of shock seem to appear 
in the total correct score at Stage 2, the 
perseverative errors at Stages 2 and 3, and 
the nonperseverative errors at Stages 1 and 2. 

Analyses of variance were carried out sep- 
arately for all scores at Stages 2 and 3 be- 
cause these stages seemed to contain most of 
the variation in scores. The analyses are 
summarized in Table 3. Again, it is seen that 
only the electric shock, alone or in com- 
bination, produces significant effects, and 
that the significant effects are limited to Stage 
2 of the experiment. It is noteworthy, in view 
of Hamilton’s observations (10, 16), that per- 
severative errors were significantly increased 
at Stage 2. 


Discussion 


The three principal findings of the present 
experiment can be summarized briefly as fol- 
lows: first, the physical threat of electric 
shock was the only stress condition which 
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TABLE 1 


Mean Perseverative Errors, NonperseveRATIVE ERRORS, AND Correct RESPONSES FOR THE EIGHT 
PRESENTATION CONDITIONS AS A FUNCTION OF Successive STAGES 








1. IMPERSONAL DISTRACTION PROCEDURE 





MEAN Score — - - 
1(C) 2(F) 3(N) 4(C) 5(F) I Tora. 





Perseverative Errors ve 3.00 6.60 3.00 -90 , 14.65 
Nonperseverative Errors , -go 3.95 2.00 -20 ° 10.90 
Correct ‘ . 80 4.10 3.10 -35 , 17.00 





Persona, HeEcKLING ProcEeDURE 





Perseverative Errors sea -90 1.20 .00 
Nonperseverative Errors ‘ ; -55 -50 -75 
Correct ‘ ‘ -05 1.45 2.15 





Exectric SHocK PROCEDURE 





Perseverative Errors ee : 8.10 3.10 1.80 
Nonperseverative Errors ‘ -0 4.40 1.30 1.40 
Correct : ; 6.50 3.05 3.05 





4. Distraction-SHock CoNnDITION 





Perseverative Errors saa ‘ . 1.90 
Nonperseverative Errors , . ° 1.10 
Correct ‘ 8. , 3.50 








Perseverative Errors nw ‘ 3.00 1.25 
Nonperseverative Errors j 5.50 1.35 1.05 
Correct ‘ ‘ 5.85 4.85 1.95 





SHock-Heckiinc CONDITION 





3.05 1.65 1.05 
Nonperseverative Errors ; ‘ 2.25 . 80 -75 
Correct 2.65 2.80 3.60 


2. 3 


Perseverative Errors 


7. Disrraction-Heckiinc-SHock ConDITION 


Perseverative Errors swale .85 5.15 ; -50 
Nonperseverative Errors 8 75 
Correct a . . -50 


8. Controt ConDITION 





Perseverative Errors sese .60 -f 1.75 ) 
Nonperseverative Errors 8 


Correct : . : 2.25 


1.7 
1.4 
3-35 


TABLE 2 
SUMMARY OF ANALYsES OF VARIANCE ON Totat Scores For: (A) Totat Correct Responses, 
(B) Perseverative Errors, (C) Nonperseverative Errors 


F's 


Source OF VARIATION A. Tor 3. PERSEVERA >. NONPERSEVERA 
TIVE Errors TIVE Errors 








1. Impersonal Distraction 2.91 0.10 

2. Personal Heckling ' 61 0.31 

3. Electric Shock -52 .38 6.04° 
4. Distraction and Heckling 0.00 
5. Distraction and Shock , . 0.82 
6. Heckling and Shock .0 . @.37 
7. Distraction, Heckling and Shock ). 0.35 
8. Error Mean Square ‘ 140.82 





* Significant at 5 per cent confidence lev 
Mean squares for error are given in italics at the foot of each column. 
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significantly affected the abstract behavior in- 
volved in the WCST; second, the effect on 
the test performance was general and cer- 
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tainly not confined to preseverative tenden- 
cies; and third, the effect on card sorting be- 
havior was transient as indicated by the lack 
of significant differences after the second 
stage. 

The implications of the present observa- 
tions are numerous. First of all, one is im- 
pressed with the relative imperviousness 
under stress of the cornplex behavior required 
of S by the WCST. Relatively severe per- 
sonal heckling and tantalizing fragmentary 
excerpts from radio programs had no sig- 
nificant effect upon the performance. Ap- 
parently, some conceptual activity is highly 
resistant to these forms of distraction in spite 
of the common-sense prediction that higher- 
order behavior would be especially vulner- 
able. Complex conceptual behavior may 
generally be less susceptible to degradation 
than simpler forms, although the data do 
not permit sc broad a generalization. A 
limiting consideration is the nature of the 
population investigated. The college sopho- 
more is probably fairly flexible and resistant 
to distraction. As a class, he is used to a 
background of radio stimulation and the 
deprecatory remarks of his peers. Electric 
shocks provide a direct physical threat, and 
here he does give way—temporarily. 

The generality of the reaction to the threat 
of shock is indicated by the increase in all 
scores: perseverative and nonperseverative 
errors, and total correct responses. Hamilton 
(10) and Patrick (16) commented on the ex- 
treme perseveration to be noted in response 
to a relatively simple problem where S was 
under direct physical stress. Some “persistent 
nonadjustive reactions” were doubtless pres- 
ent, but the picture presented by the three 
scores studied is one of general deterioration 
rather than the introduction of extraordinary 
perseverative tendencies alone. 

The rapidity of recovery from the adverse 
effects of shock is striking. This may be 
characteristic of the complex task or of the 
population tested or both. Although the 
shock was unpleasant, and the strong affec- 
tive response of inexperienced Ss to electric 
stimulation is well-known, the severity of the 
shock by no means precluded rapid habitua- 
tion. The present Ss showed such habituation 
a sense, attests to their essential 
Anxious Ss should show 


which, in 
“normality.” 
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greater persistence in their disorganized be- 
havior. It may turn out that the rapidity 
of recovery from stress-induced degradation 
may prove a more sensitive index to person- 
ality characteristics than the degree of the 
original disturbance. Expeziments are now 


being planned to follow up this conjecture. 


SUMMARY 


The effect of impersonal distraction, per- 
sonal “heckling,” and physical threat upon 
abstract behavior as defined by the Wisconsin 
Card Sorting Test (WCST) was studied in a 
tri-factorial design involving 160 college stu- 
dents as Ss. The WCST consists of response 
cards, each of which may be sorted for color, 
number, or form under four stimulus cards. 
The task requires S to sort according to color, 
and then, after ten correct responses, to shift 
to form, then to number and back through 
color, form, and number in six stages. No 
cues are given to shifts except that each card 
sorted is called “right” or “wrong.” ‘The 
WCST provides three scores: (a) total cor- 
rect responses required to learn each new 
response; (4) the number of perseverative 
errors; and (c) the number of nonpersevera- 
tive errors. These scores are available for 
each of the six stages. The principal findings 
were as follows: 

1. Electric shock, the physical threat, alone 
or in combination with other factors, was the 
only variable which degraded performance 
on the WCST to a statistically significant 
extent as revealed by the analysis of variance. 

2. The deterioration of performance which 
did take place was general in that all the 
three objective scores were affected. 

3. The Ss seemed to recover rapidly from 
the effects of shock, because no significant 
effects were found at Stages 3, 4, 5, and 6. 

It was pointed out that the relative im- 
perviousness to extrinsic threat exhibited by 
the abstract behavior was remarkable. The 
fact that nonperseverative error scores and 
total correct response scores showed more 
deterioration than did perseverative errors in- 
dicated that the degradation of behavior was 
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general and provided little or no indication 
that G. V. Hamilton’s “persistent nonad- 
justive reaction” was present. 
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HAT the personality of the perceiver 

determines to a large extent what is 

perceived is an hypothesis which has 
attracted the attention of many students of 
human behavior. Witkin, on the basis of his 
recent research in perception, has concluded 
that perception must be considered in a 
“broadened context, that is, as a process that 
can be understood only in relation to the 
over-all psychological organization of the per- 
son concerned” (7, p. 145). Although stated 
in different terms, Klein and his associates 
are apparently expressing the same idea in 
their statement that “more must be known 
regarding the  interrelationships between 
valuer, value and the stimulus situation” (4, 
p. 1). They have suggested that the indi- 
vidual himself determines his own perceptual 
behavior and hence, assert that it is of crucial! 
importance to “seek to understand the place 
that perception holds in the person’s adaptive 
economy” (5, p. 32). The present series of 
experiments was designed to demonstrate 
experimentally such a relationship between 
an individual’s general personality structure 
and his perceptual response or process. 

The general point of view which has been 
adopted here regarding the perceptual process 
is that what an individual perceives is gov- 
erned largely by the body of assumptions 
which he has learned in the past. Stated in 
nother way, it might be said that percep- 
tion would not be what it is except for these 
learned assumptions—the term assumption 


being defined broadly enough to include 


the individual’s expectancies and purposes. 
Thus, it is suggested that if one is to approach 
any conclusive understanding of the percep- 
tual process, one must be concerned with 
wal dissertation 

studies were 

, n 

Th 


yuthor 
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ssistance in the carrying out of this 


considerably more than the anatomical and 
physiological characteristics of the perceiver 
and the physical dimensions of the stimulus 
and environment. Rather one must be 
directly concerned with what Witkin re- 
ferred to as the individual's “over-all psy- 
chological organization” and with what 
Klein has called the person’s own “adaptive 
economy. 

Integral to the interpretation of the per- 
ceptual process which is being suggested here 
is the notion that the act of perceiving is, in 
Dewey’s and Bentley’s terms, transactional. 
In their own words, they have said that 
“observation of this general (transactional) 
type sees man-in-action not as something radi- 
cally set over against an environing world, 
nor yet as merely action ‘in’ a world, but as 
action of and by the world in which the man 
belongs as an integral constituent” (2, p. 228). 
They further suggest that all of an individ- 
ual’s behavior, including “his most advanced 
knowings,” is considered as activity “not of 
himself alone, nor even as primarily his, but 
as processes of the full situation of organism- 
environment” (3, p. 506). Referring to the 
perceptual response specifically, two conclu- 
sions might be drawn. First, the perceptual 
experience should be considered as an active 
rather than a static process. Second, the act 
of perceiving should be considered as a proc- 
ess involving not simply an awareness of 
those aspects of the environment relevant to 
achieving the goal of the moment, but rather 
as a process in which particular stimuli 
acquire meaning because of the fact that we 
know from our past experience that stimuli 
have the possibility of various kinds of sig- 
nificance, and then we perceive in the stimu- 
lus the particular significance which has been 
consistently related to the achievement or 
denial of our purposes(1). Thus the per- 
ceptual experience of the moment represents 
what we have termed an “individual-en- 
vironment transaction” involving primarily 
what the individual perceiver brings to the 
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perceptual act and secondarily, the objective, 
physical aspects of the situation. 

In short, it might be said that an individual 
perceives in a particular way because of his 
past experience and in relation to his goals 
and expectancies. It would seem, therefore, 
that an understanding of the perceptual 
process can be approximated only if we take 
into account the events which have occurred 
in the past and which are anticipated in the 
future as well as those of the moment. In 
effect, the perceptual act as it occurs in the 
“now” is meaningless unless we consider as 
part of it the prior experiences and expec- 
tancies for the future of the observer. 

To verify this approach unequivocally in a 
pair of laboratory investigations would, of 
course, be impossible. The studies reported 
in the following pages, however, seem to the 
author to contribution to the 
beginnings of experimental evidence for the 
underlying postulates of the conception of the 
perceptual process suggested here. 

The specific problem with which we have 
attempted to deal is that of demonstrating the 
effect which the individual’s general level of 
personal security might have upon his per- 
ception of the distance of objects in the 
environment. It was hypothesized that in- 
secure observers would tend to see objects 
differently in terms of distance than do rela- 
tively secure individuals. 

It was further suggested that a more 
marked perceptual difference would occur 
when the objects to be localized symbolized 
values thought to be more critical to the 
generally insecure observer. As the results 
of both experiments indicate, such proved 
not to be the case. 

Two main studies were conducted in the 
investigation of this hypothesis. In each 
experiment, the observers were tested for 
general level of personal security by means 
of the Knutson Personal Security Inven- 
tory (6). The second general phase of both 
experiments consisted of each observer's 
making a series of distance judgments on 
each of a number of stimulus objects pre- 
sented to him under conditions which will 
be described below. 

The security questionnaire includes a face 
data section and 41 multiple-choice and open- 
end type questions. The items are designed 


constitute a 
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to tap ten areas of human experience or striv- 
ing: (1) subjective economic status, (2) cul- 
tural activities, (3) social status, (¢) mental 
and intellectual capacities, (5) sex, (6) looks 
and appearances, (7) belongingness, (8) 
health, (9) work satisfaction, and (10) re- 


ligion. The conceptual formulation of per- 


sonal security which Knutson brought to his 
work on the problem of personal security 
and which he has attempted to translate into 
his inventory items has been stated by him 


in the following way: 


Personal security is a dynamic concept. People 
seek security within their various areas of striving: 
by achieving it, they become secure in some areas; 
by failing to achieve it, they become insecure in 
other areas. The areas of personal striving are 
interrelated and overlapping, and some may be of 
greater importance than others to the security of 
the individual. As a result, feelings of security or 
insecurity may tend to become generalized through- 
out the personality structure, or they may be pro- 
jected from one area of striving into others. 
Furthermore, the biological structure or physical 
or social environment of an individual, by limiting 
or influencing the character and direction of his 
motivations, may have a direct bearing upon the 
nature of his securities. In view of these conditions 
or possibilities, there appears to be a general level 
or status of security as well as levels of security 
within the individual areas of striving. A person's 
feeling of security or insecurity within any area of 
striving involves his own subjective evaluation of 
his success, satisfaction and surety or confidence 
with respect to the carrying out of his purposes in 
past and present situations and group relationships; 
also with respect to his expectations, hopes, fears 
or uncertainties about the carrying out of his pur- 
poses and aspirations in future situations and group 
relationships. Feelings of security within any func- 
tional area vary in a continuum, so that security 
status within any area of striving or within any 
situation may be considered a matter of degree 
rather than a matter of absolute (6, p. 22-23). 


Both studies were designed so that the 
experimental situation and the task to be 
performed in it were relatively unstructured 
and ambiguous. It was felt that should the 
hypothesized relationship between person- 
ality and perception exist, it would be more 
clearly evident in an experiment so designed. 
It was intended that the observer be placed 
in a situation which would force greater 
dependence on his part upon the particular 
personality structure which he brought to the 
given task. A more detailed account of the 
required task and the apparatus employed 
is included below. In general, however, the 
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following conditions were adhered to: (a) 
the observers were told nothing about the 
real purpose of the experiment, (4) the sub- 
jects were not permitted to see either the 
laboratory or the apparatus, (c) the observers’ 
visual field, during the course of the percep- 
tual phase of the experiment, was limited to 
the test and comparison objects, and (d) the 
comparison object was of neutral configura- 
tion and unknown size. 


EXPERIMENT I 


Apparatus and Materials 


Viewing apparatus. The viewing apparatus, a 
modification of one of Adelbert Ames’s demonstra- 
tions (the “thereness-thatness” demonstration), con- 
sisted of a viewing cubicle in which the subject was 
seated and to the front of which was attached an 
alleyway 15 feet long, 4 feet wide, and 31 inches 
high. Mounted in the center of the alley and 
extending its full length was a section of track, 
calibrated in inches, on which rode an illuminated, 
black-and-white checkerboard pointer (the com- 
parison object). The pointer was eight inches 
square. A Clason projector was mounted on top 
of the viewing cubicle. The projection screen was 


placed on a level with and at a distance of 12 feet, 


8 inches from the projector. The screen was 
actually outside of the observer's field of vision, but 
by means of (a) a 45° mirror set on a level with 
the screen and on the front edge of the top of the 
viewing cubicle, and (b) an optical device mounted 
on the outside of the viewing aperture in the booth, 
the image was brought within the field of vision 
of the observer. The subject, while seated in the 
viewing cubicle, looked through this box-like optical 
device which consisted essentially of a cardboard 
cutout and a half-silvered mirror set at a 45° 
angle. The net effect of this device was twofold: 
(a) it allowed the observer to view the experi- 
mental object monocularly (the left eye being 
blocked) while (b) permitting him at the same time 
to view his indicator or pointer binocularly. By so 
designing the apparatus and lightproofing the lab- 
oratory room, what are usually cited as cues for 
depth perception were virtually eliminated as far as 
localizing the experimental objects was concerned. 
At the same time, the pointer was seen binocularly 
and hence localized reasonably well. 

Stimulus Twenty-five stimulus objects 
were selected, photographed and mounted in slides 
one by inches in size. Twenty-two of the 

black and white while three were 
Included in the group were such items as 
Sex, Kike, Boss, AFL-CIO, and several 
slides picturing geometric designs, an expensive car, 
a ten dollar bill, and a laborer. (See Table 1 for a 
complete list.) In each case, the objects were either 
colored or white against a black background. 


objects. 


four 
objects were 
cok pred. 


the words: 


HastIncs 


Bausch & Lomb Ortho-Rater: The ortho-rater 
was employed to check on the normality of the 
observer's vision. 

Knutson Personal Security Inventory: (See above 
for a brief description). 


Observers 


The sixteen observers who participated in the 
experiment were randomly selected employees of 
The Educational Testing Service located at Prince- 
ton, New Jersey. Although the sample was ex- 
tremely small, on such parameters as age, level of 
education, and income, it represented a reasonably 
good cross section of the employees in this 
organization. 


Procedure 


Each subject participated in two experimental 
sessions. During the first session, three settings 
were made on each object while during the second, 
the observer made two more settings on all the 
stimulus objects and then filled out the Knutson 
Inventory. By way of introduction to the experi- 
ment, each subject was informed simply that he was 
about to help us by participating as an observer in 
an experiment in visual perception. It was ex- 
plained further that since we wanted him to be as 
naive as possible, it would be necessary to prevent 
him from seeing the apparatus and the laboratory 
until after he had finished making his observations. 
Each person was then blindfolded at some distance 
from the laboratory room and led to it and the 
viewing cubicle. Once here, the blindfold was 
removed, the subject seated comfortably, and those 
parts of the equipment with which he would be 
concerned—namely, the viewing aperture and the 
crank handle with which he was to move the 
pointer—were explained to him. The instructions 
given were purposely very brief. The observer was 
told that he was about to see through the aperture 
a series of objects, one after the other, and a black 
and white checkerboard pointer. He was informed 
that his task was to move the pointer (in the depth 
dimension, i.e., back and forth) until in his judg- 
ment it appeared to be directly beneath the object. 
The objects were then presented to the subject in 
random order and settings were made and recorded. 
The starting position of the pointer was consistently 
at that point on the alleyway nearest the observer. 


2 To ascertain the effect of the starting position of the 
pointer, a pretest was made in which the indicator was 
started randomly at both the near and far ends of the 
range. It was found that the starting position of the 
pointer governed only the location of that segment of 
the total range within which most observers made most 
of their settings. In other words, when the pointer was 
located at the near starting position, the stimuli were 
judged by most to be in the near half of the range and 
Those subjects who tended to see the objects 
being nearer did so no matter which segment of the 
range was being used. \n view of this, it was felt that 
it would be unnecessary to start the pointer at randomly 
selected points. Hence, at all trials for all observers the 
indicator was started at the near position. 


vice versa. 


as 
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While the experiment was in progress, the labora- 
tory was blacked out except for the light from the 
image itself and the illumination of the pointer, 
both of which were held constant for all subjects at 
all times. The walls, floor, ceiling, and apparatus 
were all black-opaqued. 

Upon completing the three trials in the first ses- 
sion of the experiment, the subject was blindfolded 
again and led out of the laboratory. No indication 
of results was given. The interim period between 
the two sessions averaged five days, and in no case 
was it less than three. As indicated previously, dur- 
ing the second session, the observer made two more 
settings on each object and completed his work by 
filling out the Knutson Inventory. 


REsuLTs 

The Knutson system of scoring the inven- 
tory was followed. His procedure involves 
assigning a value of one to the so-called 
“secure” alternative answer in each item of 
the questionnaire and a score of zero to the 
“insecure” and “don’t know” responses. 
Using this procedure, the maximum score 
which can be attained on the inventory is 41. 
The security scores of our population of 16 
ranged from 25 through 38. 

Using the mean of the settings made on 
each of the 25 objects, coefficients of corre- 
lation were derived between the mean set- 


tings on each stimulus object and the security 


scores. As indicated in Table 1, the rho’s 
ranged from +.50 through +.80. These 
coefficients indicate that the more insecure 
the person tended to be, the nearer he would 
localize the stimuli. Since the population 
was so small, it was advisable to use the 
rank order rather than the Pearson product- 
moment correlation method. Just as a mat- 
ter of interest, however, r’s were obtained for 
a few of the stimulus objects. In each 
instance, the magnitude of the correlation 
dropped slightly. For example, the correla- 
tions on the slide picturing the expensive car 
were +.80 (rank order) and +.77 (Pearson 
product-moment). 

The data were further analyzed by select- 
ing a random sample from the group of 25 
stimulus objects, and deriving F-ratios for 
them in order to test for significance of dif- 
ferences between the means. In each case 
the differences were found to be significant. 
In addition, a #-test for significance of the 
coefficients of correlation was applied to the 
data. It was found that a rho of +.50 or 
better was significant at the .05 level while a 
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rho of +.80 was significant at the .o1 level of 
confidence. 


TABLE 1 


COEFFICIENTS OF CORRELATION BETWEEN SECURITY 
Scores AND Megan Distance LocaLiZaTIONS 
FoR TWENTY-FIVE STIMULUS OBJECTS 
EMPLOYED IN ExpEeRIMENT I 








Siuwe TITLe 





Expensive car 
Scissors 

Laborer (full face) 
Screw driver 
Geometric design 
Laborer (hands) 
Lawyer (word) 
CIO-AFL (word) 


Freedom (word) 

Boss (word) 

Geometric design 
Executive’s desk and chair 
Store scene 

Typewriter 

Manager (word) 

Kitchen scene 


Ten dollar bill 
Strike (word) 
Negro naval officer 
College (word) 
Children (word) 
Bank (word) 
Plumber (word) 
Kike (word) 

Sex (word) 





ExpErIMENT II 


The basic hypothesis tested in the second experi- 
ment was the same as that investigated in the 
initial study. There were two general reasons for 
conducting the second study: (1) to check, using a 
larger population, the results obtained in Experi- 
ment I, and (2) to inquire into certain secondary 
questions which arose as a result of the particular 
problem studied and the procedure used in the first 
piece of work. 

Specifically, the questions which were under in- 
vestigation included the following: First, will results 
similar to those of the first experiment be obtained 
using a larger population if, from a large popula- 
tion, the high-scoring and low-scoring groups on 
the criterion measure (the Knutson Personal 
Security Inventory) are preselected as observers? 
Second, will use of the interview technique be 
valuable in terms of reinforcing or throwing addi- 
tional light upon the results obtained from the 
security questionnaire (the criterion measure)? 
Third, is there a significant relationship between the 
level of security of an individual and the amount 
of time it takes him to perform the required per- 
ceptual task? Fourth, (a) in terms of possible 
kinesthetic cues, does it make a difference if the 
observer is responsible for actually moving the 
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pointer from arting position to the place where 
} 


t? (b) What effect, if 


placing the indicator to the left of 


he sees the 


any, will and 


parallel 


having it 


with the stimulus object (as opposed to 
ippear beneath the object) have upon the 
task of localizing the objects? 
With 


ch ny 


the 
erimental design, pro 


these questions in mind, following 
in the apparatus, ex] 


and selection of subjects were carried out. 


Mistertal 
Materials 


respects, 
l The two 
first, so constructing the car 
rode that the pointer 
be to the lett of and 


stimulus 


» as described al 
pointer 
object, and 


that the 
the subject’s 


al de vice so 
center of 
of regard was parallel to 


cord 


fer was el iployed to r 
| ct in making each 


1 subj 
the main section of the 
Inventory were dropped 
ertinent to the undergrad 
all the observers were 


Re )? de r 
given to the open-end 

the semistructured interview which 
end of the 


was used to 
> answers 


experimental session. 
ber of stimulus objects used was reduced 
five to eight. They were all black and 
black background and pictured the 
(Image dimensions are included 


he nun 
* against a 
following objects: 


after each ol yect) 


1. Rorschach ink blot, (Card 


IV) 


orer 


8% x 92 niches 

(full face) 84, x9 inches 
automobile 9% x 2% inches 
7 x4% inches 

7x9'2 inches 
ABCD—(letters) 8*4 x2% inches 
SEX—( word) 8°4 x 2% inches 
inches 


KIKI 8% x2% 


Exy 
Pack of 


Playing card 


cigarettes 


(word) 


Forty 


as observers 


two university undergraduates participated 
This number was split about equally 
junior, and senior classes. 
vf an advanced course 


weeks before 


among 


the sof homore, 
All were meml 

Approximately 

> of the experiment got under way, 


in social 
the 


ers ¢ 


S1X 


(76 men) were asked to 
From this group of 


s the course 
Knut 


men, 


n Inventory 


out the 
venty-six per cent who scored highest 


saheee who scored lowest were 
Out of 

the over-all range spread from 11 through 35 with 

the low group ranging from 11 through 23 and the 

high group from 29 through 35. Every effort was 


prevent the members of these groups 


a pe ssible score of 38, 


expended to 


Pur K. 
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from perceiving any connection between taking the 
inventory and participating in the experiment at a 
later date. 


Procedure 


After the subjects had been selected, each ob- 
server participated singly in cne experimental ses 
sion lasting, on the average, one and one-quarter 
hours. As in the first experiment, a blindfold was 
placed on the individual at some distance from the 
laboratory and he was led to the viewing cubicle 
made. After 
being instructed briefly, the subject made one obser- 
vation on each of four practice objects. 


in which his observations were to be 


These w ere 
the words children and plumber and pictures of a 
typewriter and a five dollar bill. This practice 
period was included in order to familiarize the 
subject with his task and to allow a minimum of 
dark adaptation to take place. 

the ex- 
perimental objects were presented to the subject one 
after the other in the following order: (1) pack of 
(2) ABCD, (3) Rorschach ink blot— 
Card IV, (4) laborer, (5) sex, (6) expensive auto 
mobile, (7) tke, ard (8) playing card. Each 
observer completed a total of three trials on each 
experimental object. As opposed to the precedure 
followed in Experiment I, the pointer was moved 
by the eyperimenter at the instruction of the sub- 
ject. <A record of the time and the 
localization distance was taken. 


Upon completion of the practice trials, 


cigarettes, 


consumed 


After the subject had completed his observations, 
he was interviewed briefly. The interview was 
built around both specific and open-end questions 
designed to elicit introspection on the part of the 
subject regarding his performance in the experi- 
ment and certain aspects of his personality. 


REsULTs 


The answer to our first question, namely, 
whether results consistent with 
Experiment I would be obtained using a 
larger population, is indicated in Table 2. 
Those observers who, on the criterion 
measure, appear in general to be relatively 
insecure, tended to localize all objects nearer 
than did the secure group of subjects. Again 
it seems evident that those stimuli selected as 


those in 


possibly being symbols of critical areas of 
insecurity fail to elicit a perceptual perform- 
ance differing from that obtained with the 
so-called neutral objects. A small sample 
t-test for significance of difference between 
The 


levels of confidence for each stimulus object 


the means was applied to the data. 
are included in Table 2. Both the degree 
and direction of the difference between the 
two groups perceptually is perhaps more 
strikingly shown in the final item in Table 2. 
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As indicated earlier, it was hoped that the 
interview might provide some insight into 
what the observers were consciously experi- 
encing as they performed the perceptual task 
and also, certain questions asked in the inter- 
were designed to check some of the 
items appearing in the criterion measure. In 
tabulating the Knutson Inventory, those 
questions which elicited the greatest differ- 


view 


ence between the secure and insecure popu- 
lations were culled out and categorized. The 
same procedure was followed with the items 
included in the interview. 
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group was 176 seconds and that for the 


insecure group, 154 seconds. The ¢ ratio 
was only .27 indicating that the difference of 
two seconds might readily occur by chance. 

Finally, the two changes in apparatus de- 
sign introduced in the second study, namely, 
having the experimenter rather than the 
observer actually move the pointer, and 
changing the location of the pointer from 
beneath the stimulus object to a position at 
the left of it, appeared to have little effect in 
view of the consistency of results in experi- 
ments 1 and 2. 


TABLE 2 


FOR SECURE 
CONFIDENCE OF 


MEAN OBSERVATIONS 


aND INsEcuRE Groups oN Eacun STiMULUs OBJECT AND THE LEVEL OF 
DIFFERENCES 


BETWEEN THE MEANS 


Mean Distance AT WHICH LocaLizep 


CURE GROUP 


InsecuRE Group 


Con FIDENCE 


DIFFERENCE 
Limit 


is 








153 inches 


122 inches 


inches 


w 


Pack of « 119 
Laborer 
Rorschach blot 
ABCD 


Playing card 


igarettes 151 
157 129 
156 128 
150 121 
135 110 
166 145 
152 132 


2 ON 


“wn 


Automobile 


Kike 


All objects combined 153 


On the basis of both the questionnaire and 
the interview results, it would seem correct 
to conclude that the insecure observers were 
considerably more concerned with: 

1. their vocational potential, i.c., chances 
for success as they define it; 

2. their relations with the opposite sex and 
other people in general; 

3. their personal appearance and the status 
of their health; 

4. their capacity to 
goals meaningful to them; 

5. the degree of happiness which they 
enjoy at the moment. The reader is referred 
to Table 3 for samples of these data. 

With reference to the possibility that there 
might be a significant relationship between 
level of personal security and the amount of 
carry out the 


establish and attain 


time an individual needed to 
required task, it was found that the secure 
and insecure observers, as groups, did not 
differ significantly.* The mean for the secure 
The fact that it took the insecure obs 


al to make the “near” loca 


rvers just as 


essenti lizations as it 


NWN NW WN Ww 
M-WNNWN NNN WD 


™“ 


Some IMPLICATIONS OF THE RESEARCH 

The fact that those individuals who give 
evidence of being relatively insecure tend, 
when placed in an ambiguous perceptual 
situation, to see objects closer to them would 
appear to be the most important result ob- 
tained from the two experiments. Phrased 
in more general terms, these investigations 
have demonstrated somewhat convincingly 
that given an unstructured perceptual situ- 
ation, that is, one in which the usual tools 
for performing the task are either in conflict 
or virtually absent, the perceptual process will 
follow rather specific patterns for observers 
with varying personalities. Further, it might 
be inferred, at least, that the particular per- 
ceptual performance or process brought out 
in the experimental situation is not simply an 
isolated exception, rather it is most likely 
characteristic of the individual in the sense 


took the secure group to make “far” localizations might 
be interpreted as indicating greater timidity and inde- 
cision on the part of the insecure groups, factors some- 


times associated wtih insecurity. 
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TABLE 3 


SAMPLE ITEMS FROM THE Knutson Personal Security Interview AND THE PERSONAL INTERVIEW 











Success 


Knutson Inventory 


1. Are you planning now to do something on your own 
that you are sure will give you a feeling of success? 


Interview 


2. Do you think that your chances of achieving what 
you want most in life are good or not so good? 


Secure INSECURE 
(in percentages) 

19 

53 

25 


Yes 
No 
Don’t know 


62 
25 
10 


Good 
Not so good 


go 
10 


72 
28 


PERSONAL APPEARANCE AND HEALTH 


Knutson Inventory 
3. Do you think that your looks and appearances have 
tended to help you, or have tended to stand in the 
way of your getting ahead? 
4. As far as your health in general is concerned, does 
the future look very good, fair, or poor? 


Tended to help 
Tended to stand in way 
Don't know 

Very good 

Fair 

Poor 


SociaL RELATIONS 


Knutson Inventory 
5s. Would you say that your future in this area (that 
is getting along with members of the opposite sex) 
looks very good, fair or poor? 
6. Do you feel satisfied or dissatisfied in your relations 
with the opposite sex? 


7. Do you feel sure that people you go with accept you 
as a social equal, or do you wonder about that? 


Interview 
8. Do you think that your marriage is or will be all 
that you want it to be? 


Very good 

Fair 

Poor 

Satisfied 
Dissatisfied 
Don't know 
Feel sure 
Wonder about it 
Don't know 


Yes 
No 


HAPPINESS 


Knutson Inventory 
9. Does life tend to be very happy, fairly happy, some- 
what unhappy, or very unhappy for you? 


Interview 

When you stop to think about it, does it seem to you 
that, by and large, you are happier, not as happy, or 
equally happy as the people you go with? 


10. 


CONFIDENCE IN ESTABLISHING 


Knutson Inventory 
what you want most out of 


11. Are you sure you know 


life? 


Interview 


12. In your opinion, is your over-all level of aspiration 
higher, lower or about the same as the people with 
whom you associate most closely? 


Very happy 

Fairly happy 
Somewhat unhappy 
Very unhappy 


Happier 
Not as happy 
Equally happy 


AND MAINTAINING GOALS 
Yes 
No 
Don’t know 


43 
43 
14 


Higher 
Lower 
About the same 


48 23 


9 
68 


4 
48 





that it represents a way of acting which has 
in the past under similar circumstances 
proved to be of value to him in some way. 
In this connection, reference might be 
made again to the research findings of 
Klein (4) and Witkin (7). Klein found that 


certain subjects tended consistently to over- 


estimate or underestimate the size of his discs 
regardless of the nature of the stimulus 
objects or any single aspect in the experi- 
ential backgrounds of his three groups of 
subjects. In his opinion, it was the difficulty 
of the task which seemed to be related to this 
perceptual tendency. Witkin has also sug- 
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gested that specific ways of performing per- 
ceptually are related to specific personality 
types. 

Although to demonstrate experimentally 
that just as people vary in their personalities 
so do they vary in their perceptions is an 
important first step, it still remains to explain 
why a particular person tends to perceive in 
a particular way. In this instance, for 
example, why does the insecure observer tend 
to see all objects closer? What function does 
this particular mode of perceiving have for 
the particular individual? 

The answer to this question lies beyond the 
limits of these experiments, for it would be 
necessary to carry out a rather complete 
analysis of the individuals involved in order 
to demonstrate the function which the per- 
ceptual process has in any given instance. 
Hence, if we are to arrive at any answers to 
the question of the function of particular 
perceptual processes, it seems evident that 
future research must have clinical supple- 
mentation. Not only should there be an 


increasing adoption of some of the techniques 
used by clinicians and psychiatrists, but also 
their insights, growing out of their practical 
experience with individual patients, should 


be considered carefully. 

The study reported here obtained data 
only on distance judgments and the problems 
raised obviously need further experimenta- 
tion. Another research problem stemming 
from these investigations would be to study 
the effect of varying the degree of ambiguity 
in the perceptual situation. We have con- 
cluded that the presence of conflicting cues 
and the unstructuredness of the situation 
were integral to the results obtained. The 
logical next step would seem to be to increase 
and decrease the degree of ambiguity. If our 
conclusion is correct, it would be expected 
that the perceptual difference between the 
two groups of observers would tend to de- 
crease as the degree of ambiguity decreased. 


SUMMARY AND CONCLUSIONS 


Two separate investigations were under- 
taken in order to examine the hypothesis that 
a significant relationship exists between an 
individual’s personality and his perceptions. 
Specifically, it was asked whether a person’s 
general level of personal security influences 
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where he visually perceives objects in the 
environment when the situation is such that 
the usual cues are either absent or in conflict. 
As a corollary to this query, it was suggested 
that there might be a relationship between 
an individual's level of personal security and 
where he perceives particular stimuli pre- 
selected as symbols of certain areas of human 
striving. 

In both studies an adaptation of one of 
Adelbert Ames’s demonstrations (the “there- 
ness-thatness” demonstration) was employed 
as the basic piece of equipment. The stimu- 
lus objects were presented to the observer as 
projected images or pictures. A measure of 
personal security was obtained on all ob- 
servers by means of the Knutson Personal 
Security Inventory. 

In the first study, 16 subjects were ran- 
domly selected and asked to make five dis- 
tance localizations on each of 25 stimulus 
objects. Following the perceptual phase of 
the experiment, the subject filled out the per- 
sonal security questionnaire. For each stimu- 
lus object, coefficients of correlation were 
derived between the subjects’ mean distance 
localizations and their security scores. The 
rho’s ranged from -+.50 through +.80 with 
no apparent difference between the so-called 
value symbols and the more neutral stimuli. 
The coefficients of correlation indicate that, 
in general, the more insecure an individual 
tended to be, the closer he tended to see all 
objects. 

The second investigation was undertaken 
in an effort to provide answers to the follow- 
ing questions. First, will results consistent 
with those of the first experiment be obtained 
if, from a large population, the high and low 
scoring groups on the criterion measure are 
preselected as observers? Second, will use 
of the interview technique be valuable in 
terms of reinforcing the results obtained from 
the pencil and paper questionnaire (the cri- 
terion measure)? ‘Third, is there a signifi- 
cant relationship between level of security 
and the amount of time it takes an individual 
to perform the given perceptual task? 

The Knutson Personal Security Inventory 
was administered to 76 university under- 
graduates, and from this population the 21 
who, scored highest and the 21 who scored 
lowest were selected as subjects. Each indi- 
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vidual was asked to make three observations 
on each of eight objects. The experimental 
session was terminated with an interview. 

As in the first study, those individuals who 
were relatively insecure tended, by and large, 
to see the stimulus objects as nearer. Fur- 
ther, there was no apparent difference be- 
tween the results elicited from the so-called 
value-symbol objects and those which were 
considered to be more neutral. 

The the 
indicated no significant difference between 
the amount of time taken by the secure and 
groups, to make the 
distance localizations. The information ob- 
tained in the interview proved, for the most 
part, to be consistent with that brought out 


results of second investigation 


insecure observers, as 


in the questionnaire. 

In conclusion, these experiments tend to 
confirm the two following propositions: 

1. In an unstructured perceptual situation, 
where individual stimuli de- 


an perceives 


pends to a large degree upon what he brings 
to the act of perceiving. In this instance, the 
evidence indicates that the more insecure a 
person is, the closer he tends to see objects. 


Pur K. 
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2. The objective content of the stimuli is 
not necessarily coercive for the perceptual 
process. It would appear rather to be the 
nature of the situation coupled with tne 
experiential background (personality and ex- 
pectancies) of the perceiver which govern his 
perceptual process. 
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THE EFFECT OF DIFFERENTIAL INSTRUCTIONS ON 
ANXIETY AND LEARNING 


SEYMOUR B. SARASON, GEORGE 


MANDLER, 


anp PEYTON G. CRAIGHILL 


Yale University 


N a previous publication (1) the writers 
presented experimental data on the dif- 
ferences in learning and performance 
in the 


I 


between two groups who differed 
amount of anxiety experienced in testing 
situations. An important question arising 
from these data concerned the effects of dif- 
ferent sets or instructions on subsequent 
learning. Experimental data on this ques- 
tion will be here presented. The data are 
based on two different studies using two dif- 
ferent samples of subjects. In both studies 
the high anxiety (HA) and low anxiety 
(LA) groups represent the extremes of a 
distribution obtained from the responses of 
college sophomores to the anxiety question- 
naire described in the first study. It should 
be emphasized here that the anxiety ques- 
tionnaire was specifically concerned with 
attitudes and experiences in testing situations. 
As in the first study the subjects were not 


aware of any connection between taking the 
questionnaire and being called back for indi- 
vidual testing. 

Theoretical considerations. The following 
theoretical framework is summarized from a 
more extended presentation in a previous 
article (1). 


Anxiety will be considered as a learned drive 
with the characteristics of a strong stimulus. When 
anxiety has been learned as a response to situations 
involving intellectual achievement (e.g. test situa- 
tions), two types of responses will tend to be 
evoked: 

(a) Responses which are not task-relevant; self- 
centered feelings of inadequacy, attempts at leaving 
the situation, etc. 

(6) Task-relevant responses which reduce the 
anxiety by leading to completion of the task. These 
are identical with other task-relevant responses 
evoked by the testing situation. 


The following hypotheses are relevant to 
the present study: 

1. Individuals with a strong anxiety drive 
(HA subjects) will perform poorer than 
those with a low anxiety drive when the task 
irrelevant responses interfere with adequate 
performance. 
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2. When a stimulus situation contains 
elements which specifically arouse test or 
achievement anxiety, this increase in anxiety 
drive will lead to poorer performance in 
individuals who have task-irrelevant anxiety 
responses in their response repertory. For 
individuals without such response tendencies, 
these stimulus elements will raise their gen- 
eral drive level and result in improved per- 
formance. (Such elements would be any 
reference that the individual is being judged, 
a statement of expected performance, etc.) 


EXPERIMENT I 
Method 


At the beginning of the testing session the 
subject (S) was told that he was going to be 
given some intelligence tests as part of a study 
relating to the aptitude tests given to every 
Yale freshman. He was told that his scores 
in the session would be cormpared with his 
aptitude test scores to make a better inter- 
pretation of the latter possible. 

At this point the HA (N36) and LA 
(N=36) groups were each split in two: 
one-half of each group being told that they 
would be expected to finish in the time 
allotted to the test, and the other half being 
told that they were not expected to finish. 
The expected-to-finish group (ETF) was told 
that “the test is designed so that it should 
be fairly easy for the average college student 
to complete the test within the time limit. 
You should have little difficulty in finishing 
it, at least by the second or third time.” 
After the first and third trials this group was 
again told that they were expected to finish. 
The not-expected-to-finish group (NETF) 
was told that the test was constructed so that 
nobody could finish the test within the time 
limit. After the first and third trials they 
were reminded that they should not worry 
about finishing because nobody could finish 
it. (Actually no one could or did finish the 
task in the time allowed.) 

The test used was a modification of the 
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Wechsler-Bellevue Digit Symbol subtest, a 
50-second time limit being used for each of 
five trials. Five different forms of the test 
were devised, each form differing in the 
number of digit-symbol units on a line. This 
was done to make it difficult for the subject 
to determine if and to what extent he was 
improving from trial to trial. For each sub- 
ject the order in which the five different 
forms were given was randomized. 
Predictions 

The predictions given below are based on 
the theoretical considerations summarized 
above: 

Prediction I. In the HA group those who 
are given the ETF (expected to finish) in- 
structions will have poorer scores than those 
who are given the NETF (not expected to 
finish) instructions. This prediction is based 
on the assumption that the ETF instructions 
will increase the strength of the irrational or 
task-irrelevant anxiety responses while the 
NETF instructions will reduce the strength 
of these anxiety responses or maintain them 
at their initial level. 

Prediction Il. In the LA group those who 
receive the ETF instructions will have detter 
scores than those who are given the NETF 
instructions. This prediction is based on the 
assumption that the ETF instructions will, in 
this group, result in a higher drive level than 
will be the case with the NETF instructions. 
In the HA group the higher drive level 
strengthens the predominant task irrelevant 
r oonses, while in the LA _ group it 
strengthens the predominant task relevant 
responses. 


Results 


Figure 1 shows the learning curves for the 


four groups. Table 1 shows those differ- 
ences among the means of the four groups 
on trials 1 and 5 which were significant or 
approached significance. It is apparent from 
the table that on trial 1 the predictions hold 
up very well. It is also clear that on this trial 
both the HA groups do significantly poorer 
than the LA-ETF group although they do 
not differ significantly from the LA-NETF 
group. 

On the last trial the prediction concerning 
the LA groups is again confirmed: the LA- 
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43 
42 
4) 

40 
39 
38 
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LA-ETF O———0 
LA-NETF @-—=—-0 
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HA-NETF @----@ 








2 3 4 
TRIAL 
PERFORMANCE Curves OF For ExpeRIMEN- 
t Test IN 


Fic. 1. 
TaL SusGRouPps on Dicit Sym 
EXPERIMENT | 


NETF being significantly poorer than the 
LA-ETF group. The prediction concerning 
the HA groups is not supported. On trial 5, 
as on trial 1, the LA-ETF group is superior 
to all other groups. 

A question relevant to the present study 
concerns possible differences in variance 
among the four groups. A comparison of 
differences in variances among the four 
groups for each of the five trials did not re- 
sult in any significant findings. In order to 


TABLE | 


SIGNIFICANCE OF DIFFERENCE BETWEEN Four Sus- 
GROUPS ON TRIALS I AND 5 OF Dicir Symsot TEst 


TRIAL CoMPARISON 





HA-ETF vs. HA-NETF 
HA-ETF vs. LA-ETF 
HA-NETF vs. LA-ETF 
LA-NETF vs. LA-ETF 
HA-ETF vs. LA-ETF 


s 
5 LA-NETF vs. LA-ETF 
5 HA-NETF vs. LA-ETF 





* One tail of the distribution of ¢. 


Note: The group with the better score is in italics. 
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test the differences among the groups in vari- 
ance change from trial 1 to trial 5 the follow- 
ing statistical procedure was used. In each 
group the 18 subjects were assigned ran- 
domly to one of six groups. For each of these 
groups of three subjects, an estimate of vari- 
ance was obtained for trials 1 and 5. The 
variance on trial 5 was then subtracted from 
that on trial 1, resulting in six values which 
were then averaged. For each of the four 
major groups, therefore, a mean variance- 
change score was obtained which then served 
as the basis of the analysis of variance (Table 
2). The analysis of variance shows that the 
anxiety variable is not a factor in producing 
significant variance changes from trials 1 to 


TABLE 2 


ANALYsIS OF VARIANCE OF MEAN VARIANCE CHANGE 
FROM TRIAL I TO TRIAL 5 








MEAN CHANGE ScoRES FOR THE Four 
EXPERIMENTAL SUBGROUPS 
Low 

ANXIETY 


Hicu 
ANXIETY 





Expected-To-Finish (ETF) 37- 
Not-Expected-To-Finish (NETF) 3- 


7 
0 





ANALYSIS 


SouRCcE df Mean Squares F 





Total 23 
Error 20 
Anxiety I 1108 
Instructions I 5289 
Interaction I 256 


1516 





5 but that the instruction variable approaches 
significance. The near significant effect of 
the instruction variable appears largely due 
to the relatively small variance change in the 
LA-NETF group. The significant or near 
significant comparisons are as follows: 

t P 


1.95 03 


HA-ETF ; 
1.58 06 


LA-ETF 


.A-NETF vs. 
A-NETF vs. 


ExpErIMENT II 
Method 
In the first experiment the HA and LA 
groups represented the extremes of the dis- 
tribution based on the anxiety questionnaire: 
going in on one end to about the 11th and 
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on the other end to about the 89th percentile. 
In Experiment II the LA (N=12) group was 
picked from between the 11-15th percentiles 
of the same distribution while the HA group 
(N=12) was picked from between the 85- 
89th percentiles. One-half of each group re- 
ceived the identical introductory, “ego-involv- 
ing” (EI) instructions given to all subjects 
in Experiment I. The other half of each 
group was given nonego-involving (NEI) in- 
structions: they were not told that they were 
to be given an intelligence test which would 
be compared to their entrance test scores but 
instead were told that the examiner wanted 
to standardize some tasks and that “your own 
performance is only of importance insofar as 
it contributes to the total performance of a 
group of students.” All subjects were given a 
trial on a Kohs design, followed by a trial 
on a stylus maze. This procedure was fol- 
lowed until each subject had had 5 trials on 
each task. Only the results of the stylus maze 
will be reported here. The maze was iden- 
tical with that used by Pomeroy (2). Each 
subject received two scores on each trial: 
number of errors made by entering blind 
alleys and total time in seconds necessary for 
completion. 


Predictions 


As in Experiment I the instruction variable 
was expected to have opposite effects in the 
HA and LA groups. 

Prediction I: In the HA group those who 
receive the ego-involving (EI) instructions 
will do poorer than those who receive the 
nonego-involving (NEI) instructions. 

Prediction II: In the LA group those who 
receive the EI instructions will do detter than 
those who receive the NEI instructions. 


Results 


Figure 2 shows the learning curves, based 
on the time scores, for the four subgroups. 
Figure 3 shows the curves based on error 
scores. Table 3 shows those differences 
among the means which were significant. In 
regards to Prediction I the only trial in which 
it is conclusively borne out is trial 3 when 
time scores are used. 

Although the HA-EI group always gets 
poorer mean error scores than the HA-NEI 
group, and is poorer on trials 3, 4, and 5 for 
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GROUPS ON STyLus Maze 


time scores, the differences are only significant 


for trial 3. This is probably in part due 
to the large variability in each subgroup 
(N=—6). 


TABLE 3 


SIGNIFICANT #-VALUES FOR COMPARISONS OF EXPERI- 
MENTAL SUBGROUPS ON Sryut Ss Maz 


ERROR 
Scores 


TIME 


CoMPARISON : 
Scores 


TRIAL 





Foal yr 
HA-E] vs. LA-ElI .29 .002 
HA-EI vs. LA-NEI -88 .005 
HA-E]I vs. HA-NEI .08 
HA-EI vs. LA-El II 


HA-EI vs. LA-El 41 





*20 df, derived from estimate of error variance in 
analysis of variance. 
** One tail of the distribution of ¢. 
*** No significant difference. 
Note: The group with the better score is in italics. 


In none of the trials is Prediction II borne 
out. However, here again, the LA-EI group 
always dco. better than the LA-NEI group 
in both mean time and error scores. These 
findings based on mean time and error scores 
tend to support the two predictions. 

The most obvious finding in Table 3 con- 
cerns the significant differences between the 
HA-EI and LA-EI groups. While the NEI 
instructions do not appear to have a differ- 
ential effect on HA and LA groups, the El 
instructions do have such an effect. The 
anxiety reactions of a subject who is prone 
to such tendencies in a testing situation will 
not interfere with his performance unless he 
also feels ego-involved. 


Discussions AND CONCLUSIONS 


The following conclusions seem warranted 
from the two experiments: first, the anxiety 
questionnaire possesses an encouraging degree 
of validity, especially when the extremes of 
the distribution are compared with each 
other; second, stress-producing instructions 
can have opposite effects with different sub- 
jects, depending on the anxiety level in the 
testing situation. 

A question that might be asked about the 
findings in the studies so far reported con- 
cerns differences in intellectual ability be- 
tween the two anxiety groups. It might be 
argued that the high anxiety group is not as 
bright as the low anxiety group. To this pos- 
sible argument the following should be kept 
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in mind: (a) in the previously published 
study it was found that under certain speci- 
fied conditions the high anxiety group per- 
formed better than the low anxiety group, 
as was predicted; (b) differences between per- 
formance of the two groups tend to disap- 
pear as learning continues; (c) it is not clear 
how the assumption of “original” differences 
in ability can explain why the instruction 
variables had opposite effects in the two 
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anxiety groups. Studies bearing on this prob- 
lem are in progress. 
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ANXIETY-REDUCTION AS REINFORCEMENT IN MALADAPTIVE 
BEHAVIOR: EVIDENCE IN STUTTERERS’ REPRESENTATIONS 
OF THE MOMENT OF DIFFICULTY’ 


GEORGE J. WISCHNER 


University of Ulinois 


HIs paper generally is concerned with 


the nature and identification of the 

reinforcing mechanism (mechanisms) 
in maladaptive or non-integrative behavior 
with special reference to the problem of stut- 
tering. The present writer (13) elsewnere 
has presented a program of research designed 
as a systematic experimental approach to the 
problem of stuttering as learned behavior. 
A re-examination of data available in the 
literature together with the results of studies 
completed as part of the research program 
have emphasized the fruitfulness for our pur- 
poses of working within the general frame- 
work of reinforcement behavior theory (3, 
6, 8). 

The integration of symptomatic behavior 
such as stuttering within the structure of 
reinforcement learning theory raises as a 
central problem the nature of the reinforce- 
ment in such behavior. The problem in stut- 
tering behavior appears to be even more 
complex than is the case in other instances 
of symptomatic behavior. Stuttering may be 
viewed as one example of persistent mal- 
adaptive behavior with relatively immediate 
consequences that appear to be more punish- 
ing than rewarding. That the act of stut- 
tering is extremely unpleasant and carries 
relatively immediate social penalties with it 
is emphasized by practically all stutterers. 
As one stutterer remarked in a group meet- 
ing, “I feel like my insides are eaten out 
after I stutter.” 

The problems for reinforcement theory 
posed by persistent maladaptive or non-inte- 
grative behavior with relatively immediately 
unpleasant consequences have been consid- 
ered by Mowrer and Ullman (9) and more 
recently by Dollard and Miller (3). In this 
connection the latter write as follows: 

. when the painful after effects of symptoms 
are considerably delayed, they do not pose any prob- 

1 This article is adapted from a paper presented at the 


annual meeting of the American Psychological Associa- 
tion, Chicago, 1951. 


lem for reinforcement theory. Symptoms with an 
immediate painful effect pose a definite problem. 
Until detailed investigations of possible reinforce- 
ments have been made so that such symptoms are 
better understood, we will not know whether they 
fit our theory or are instructive exceptions that will 
force radical alterations and improvements in it 


(3 pp. 189-190). 

In a recent paper the present writer offered 
several hypotheses regarding the nature of the 
reinforcing mechanisms underlying the per- 
petuation of stuttering behavior. The present 
paper considers further one of these hypothe- 
ses. The hypothesis as originally stated is 
the following: 


It is proposed that the act of suttering may be 
specifically reinforced by virtue of its relatively 
close association with anxiety-tension reduction ac- 
companying the removal of a feared word. It is 
assumed that a feared word arouses a state of ex- 
pectancy (anxiety) and that the act of stuttering on 
the word is reinforced by the tension reduction 
accompanying the completion of the word on which 
difficulty is experienced. This hypothesis empha- 
sizes the possibility of a vicious cycle in stuttering 
behavior in which the completion of the stuttered 
act results in a reduction of the anxiety-tension 
evoked by the stimulus word, with consequent re- 
inforcement of the stuttering behavior (12, pp. 
330-331).? 


METHOD 


Data bearing on the hypothesis under con- 
sideration were gathered from 33 stutterers, 
30 males and 3 females, enrolled at two Uni- 
versity speech clinics. The majority were of 
college age and enrolled as university stu- 
dents. There was also a small number of 
elementary and high school age subjects. 
Exact ages were not given by all Ss. 


2In a psychoanalytic interpretation of stuttering, 
Coriat (2) makes the following interesting descriptive 
statement which bears on the present hypothesis: “If 
the motor accompaniments of the attempt to speak are 
carefully observed, the stammerer will be seen in the act 
of nursing at an illusory nipple, as shown by the sucking 
movements of the lips and tongue, the excessive flow of 
saliva, deep breathing, rapid heart beat, and yawning, 
all followed by a feeling of relaxation after the enuncia- 
tion of the difficult word.” Our interest is particularly 
in the portion of the statement we have italicized. 
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Ss meeting in small groups, were given pen- 
cil and paper and the following instructions: 
Draw whatever you think most adequately rep- 


resents your behavior immediately before, during, 
and after a moment of stuttering.® 


Questions were met with repetition of the 
intsructions and telling Ss to “do the best 
you can.” Representations were analyzed, 
not in terms of clinical interpretative cate- 
gories, but in terms of content and spon- 
taneous written language used by Ss.‘ 


REsULTs 


The results may best be approached by con- 
sidering illustrative examples of the drawings 


——— Fours whl — emehewAL bu PugGAee 
wownnns Bocanm "8 os my VOCE 


A 


Fic. 1. 


Note too in Figure 1B the symbolization for 
the pre-, during-, and after-stuttering stages. 
These are correlated with the course of the 
graph itself and reflect a process which 
terminates with feelings of relaxation and 
pleasantness as shown by the smiling faces 
in the after-stuttering period.° 

Figure 2 shows three examples of the use 
of changes in facial and related features to 
describe the events surrounding and includ- 
ing the moment of stuttering. Although the 
drawings are quite clear a few points are 
worth special mention. It is interesting to 
observe the changes in the mouth regions in 
Figures 2A and 2B. In the former the mouth 
is shown as open in the pre- and after- 








SruTrerers’ REPRESENTATIONS ILLUsTRATING Use oF GraPH AND Mopiriep GrapH CONTENT TO 
Descrise THE Cycte oF Events RELATED To THE STUTTERING Moment. A was made by a male, 24 


years; B is the product of an adult female. 


provided by Ss. Nine of the 33 drawings are 
reproduced in Figures 1 through 4. Three 
figures depict two drawings each; Figure 2 
presents the work of three individuals. 

Content Analysis. Figure 1 illustrates the 
manner in which graphs were employed by 
some Ss. The left half of this figure gives 
an example of a relatively sophisticated use 
of this type of representation; the right half 
represents a graph modified with cartoonlike 
features. Note the similarity in the course of 
the processes represented in the two drawings 
in this figure. There is a gradual rise in the 
pre-period to some maximum followed by a 
return to what might be termed the baseline. 

3 This procedure was suggested by Sary K. Wischner. 

* Acknowledgment is made to Joan Bopp who assisted 
with the analysis of data. 


periods; in the latter it is shown as closed in 
these same parts of the cycle but is open in 
the during-period. Note also the beads of 
perspiration in this latter drawing. Both 
Figures 2B and 2C also make use of “hair 
standing on end” to emphasize the emotional 
involvement of the stutterer. Of special 
interest is the after-period in Figure 2C 
showing the stick figure lying on a cot or 


5 The most dramatic use of a modified graph to depict 
the stuttering experience, which was difficult to reproduce 
here, was drawn in red pencil or crayon by a college 
stutterer. At the peak of the graph was written the 
word “block.” Above the peak was a fire, and above it, 
in turn, was a human figure lying on what was appar- 
ently a flat board. The words “extremely warm” were 
written between the peak of the graph and the underpart 
of the flame. In the after-period, the graph had returned 
to a baseline and the word “ease” was written. 
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Fic. 2. STuTrrerers’ 


REPRESENTATIONS ILLUSTRATING USE 


oF CHANGES IN FaciaL AND Retatep Features 


To Descrise THE Cycie or Events RELATED To THE STUTTERING MomeENT. A and C were made 
by male college students; B by a 26-year-old male. 


bed, apparently symbolic of a state of ex- 
haustion, relaxation, or energy depletion. 
BY Cet 

Figure 3 provides examples of what we 

have categorized as nature representations. 
8 I 

Figure 3A depicts only the before- and after- 
periods and describes a change from storm 
to sunshine. Figure 3B tells the story of a 


Fic. 3. STUTTERERS’ 


flood resulting from the breaking of a dike 
(during-period) followed by a “calming 
down” (after-period). 

The two drawings in Figure 4 are some- 
what more complex portrayals of the stutter- 
ing experience. Figure 4A generally draws 
a parallel between stuttering and the over- 














REPRESENTATIONS ExeEMPLIFYING Use oF Nature Content To Descrist THE CYCLE OF 


Events RELATED To THE STUTTERING Moment. A is the product of a male college student; B was 
drawn by a male, 14 years. 
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coming of an obstacle, in this particular case, 
a stone wall. Note again the exhausted stick 
figure flat on its back following the “weary 
sound” at the top of the wall. Figure 4B is 
particularly interesting. It pictures a series 
of seated figures which differ from each other 
only in the drawing of the visceral region. 
There is a gradual emptying of this area— 
indicated by the amount of black or white 
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apparent that some form of pure or modified 
graph or facial and related changes were the 
most frequently used content categories. 
Verbal analysis. Twenty-two of the 33 
drawings contained some kind of descriptive 
language. Representative examples of these 
for the during-, before-, and after-periods are 
given in Table 2. In general the language 
used by Ss contained expressions of the fol- 


Before 


Fic. 4. ILLusTRaTIONs oF More Compiex Portrayals BY STUTTERERS TO DescrisE THE CycLe oF EvENTS 


RELATED TO THE STUTTERING MoMENT. 
space—as the stutterer proceeds from the 
beginning to the end of the stuttering cycle. 
The before- and after-figures are identical. 

These nine examples are quite typical of 
the 33 drawings obtained. An analysis of the 
content of all representations yielded the 
breakdown provided in Table 1. It is readily 


TABLE | 


SUMMARY OF CONTENT ANALYSIS OF 
REPRESENTATIONS OF Events RELATING TO 
STUTTERING MoMENT 


STUTTERERS’ 


No. DRawINcs 


ConTENT CATEGOTY 





Faces showing physical and other changes 
& g 
Emphasis on visceral changes 
Stick figures overcoming obstacles 
Representations utilizing nature content 
t 
- or modified graphs 





Both 


A and B were drawn by college students. 


lowing states: physiological or emotional dis- 
turbance, shame or embarrassment, fear, 
tension, exhaustion, relaxation or relief. 
Process analysis. It was possible to abstract 

from the drawings the process or cycle of 
events represented in each. These are sum- 
marized below and show also the number 
of drawings in which each process was 
illustrated : 

tension-release 

normal-climax-normal 

fear-release-fear 

avoidance-escape 

physical-mental tension 


Discussion 
The simple technique employed identifies 
rather clearly, in the present sample of stut- 
terers, a cycle of events surrounding the 
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stuttering moment which appears to support 
the hypothesis under examination. Despite 
variation in manner of representation, both 
content and language demonstrate the opera- 
tion of a strikingly similar process in prac- 
tically all subjects. There is progressively 
mounting tension, anxiety, or stress followed 
by a reduction in these states immediately 
following the completion of the stuttering act. 

The course of this process is remarkably 
similar to the schematic introspectively based 
curves offered by Mowrer (7) and Schilder 
(10) to describe changes in expectancy and 
tension states in a situation where a stimulus, 
such as a light, was followed at a regularly 
recurring interval by electric shock. The 


Georce J. WIscCHNER 


virtue of the fact that he is a stutterer. In 
this context he is not so much concerned with 
stutterings and the events immediately sur- 
rounding them as with the fact that he is a 
stutterer. Finally, in the usual conversa- 
tional situation, or even in the experimental 
situation in which the stutterer might be 
required to read various kinds of materials, 
it is very likely that any anxiety-tension re- 
duction the stutterer experiences immediately 
following the completion of a stuttered word 
is obscured by the reinstatement of the entire 
cycle of events by some new word (danger 
signal). 

In an earlier portion of this paper reference 
was made to the report by stutterers of the 


TABLE 2 


ExaMptes oF LaNGcuAGE EMPLoy! 


PERIODS OF THE 


Berore 


DurRiINnG 


» py Ss in THEIR REPRESENTATIONS OF THE Berore-, DuRiNG-, AND AFTER- 
STUTTERING 


MoMENT 





Breath-chest tight Tense 
Normal-nervous condition arising 
while 
Dike broke. 
ing the land 
Tense 


Water-dike 
Tensing 
Struggle 


. Anticipaticn-conscious preparation 


. Hardly no tension Great tension 


All work up 


. Nothing there 


The water is flood- 


growing greater 
with more stuttering 
Block-excited-trying to force 


Hits the block—gets the word 


More relaxed 


Nervous condition remains for a 


Water has flooded the land and 
calming down (Fig. 3B) 

A great gust of air suddenly being 
ejected; ease (Fig. 2A) 

Weary sound-exhaustion ad infinitum 
(Fig. 4A) 

Tension diminishes although stutter- 
ing may be as great 

Kinda embarrassed—glad it’s over— 
embarrassed I had it 

Look like before—feel embarrassed— 
does not bother anymore—feel like 
did not stutter 





light stimulus in these studies and word 
stimuli for the stutterer acquire the capacity 
to serve as danger signals which instigate the 
entire process. 

One may ask why the anxiety-tension re- 
duction immediately following the moment 
of stuttering, although so clearly evident in 
the data of the present study, is not so readily 
discernible in clinical observation and report. 
There would appear to be at least three rea- 
First, it is evident that for the 
most part both observer and stutterer gen- 
erally have focused attention on the pre- and 
during-stuttering periods which emphasize 
the distress experienced by the stutterer. 
Secondly, the stutterer understandably ¢m- 
phasizes the penalties he experiences by 


sons for this. 


unpleasant consequences they may associate 


with their stuttering behavior. That stut- 
terers experience acute distress during the 
pre- and stuttering-period has been dramati- 
cally attested to in the stutterers’ representa- 
tions of the moment of difficulty. It may be 
that following the stuttering act, in addition 
to the tension-reduction demonstrated, penal- 
izing factors may also be operating. What 
does happen when an act is followed by two 
consequences, reinforcement and _ punish- 
ment? To handle this difficult systematic 
problem both Mowrer and Ullman (g) and 
Dollard and Miller (3) have found it valuable 
to invoke the concepts of gradients of reward 
and gradients of punishment. Mowrer : 

Ullman have provided experimental ey \\'ece 
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which suggests that the order of consequences 
may determine the strengthening or inhibi- 
tion of an act. They write: 

If the punishing consequence comes first and the 
rewarding one later, the difference will be in favor 
of the inhibition. But if the rewarding consequence 


comes first and the punishing one later, the differ- 
ence will be in favor of the reinforcement (9, p. 66). 


It seems that in the case of stuttering the 
more immediate consequence is the reinforce- 
ment associated with the tension-reduction 
following the completion of a stuttered word. 
In any event, notwithstanding the complexi- 
ties involved in understanding the effect of 
diverse consequences on the same behavior 
act, the present results do point to one of the 
possible reinforcements in the case of mal- 
adaptive behavior such as stuttering. 

The findings of the present study stress the 
importance of systematic and persistent attack 
aimed at isolating the possible reinforcements 
in maladaptive behavior before one may con- 
clude that no such mechanisms are operating. 
They also support our belief that stuttering 
behavior, by virtue of its specificity and avail- 
ability, and its lawful variation as a function 
of the manipulation of certain antecedent 
conditions, affords a unique opportunity for 
the study of anxiety variables and the mecha- 
nisms reinforcing anxiety-motivated behavior. 

In the present paper we have proceeded 
essentially on a qualitative level. What is 
needed is experimentation aimed at determin- 
ing the precise nature of the relationships 
between the variables which instigate the 
process illustrated in the stutterers’ repre- 
sentations and the behavior mediated by this 
process.® 


SUMMARY 


The present paper provides evidence con- 
cerning one hypothesis regarding the pos- 
sible reinforcing mechanisms underlying the 
perpetuation of maladaptive behavior such 


as stuttering. The hypothesis assumes that 
a feared word arouses a state of expectancy 
(anxiety) and that completion of the stut- 
tered act is accompanied by reduction of the 
anxiety-tension evoked by the stimulus word 

® For studies concerned with such relationships see 
Goss (4,5). A related type of study by Baron (1) has 


investigated certain functional properties of stuttering 
anxiety in an eyelid conditioning setup. 


(danger signal) with consequent reinforce- 
ment of the stuttering behavior. 

Data were obtained from 33 stutterers who 
were given the following simple instructions: 
“Draw whatever you think most adequately 
represents your behavior immediately before, 
during, and after a moment of stuttering.” 

Content and language illustrated 2 remark- 
ably similar series of events in all Ss. There 
was progressively mounting tension, anxiety 
or stress, followed by a reduction in this 
state upon completion of the stuttering act. 
Content illustrating this included pure or 
modified graphs, figures showing facial and 
visceral changes, stick men overcoming 
obstacles, and nature representations (storm- 
sunshine, flood-calming down). Language 
describing the before- and during-stages in- 
cluded “tense,” “struggle,” “hair on end,” 
“dike breaking,” “extremely warm.” Words 
for the after-period included “exhaustion,” 
“relaxed,” “relieved,” “tension diminishes.” 

The simple technique reveals a cycle of 
events surrounding the moment of stuttering 


which is in line with the above hypothesis. 
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WORD USAGE AND SYNONYM REPRESENTATION IN 
THE ENGLISH LANGUAGE 
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HE study reported herein was designed 

and conducted as a follow-up related 

to a previous experiment executed by 
the senior author (1). Results of the former 
study appeared to show that the variety of 
the individual’s synonym vocabulary for a 
concept varied directly with the frequency 
with which the individual used the concept. 
This finding was interpreted as confirming 
the hypothesis that the principle of reactive 
inhibition operates in the generation and use 
of synonyms. The present study was de- 
signed to test the same hypothesis and to seek 
similar phenomena in the more general use 


of language. The precise form of the hy- 


pothesis is quoted from the study cited above. 


when any reaction occurs, its reaction threshold 
is raised and the probability of its later occurrence 
is reduced. It would supposedly follow that the 
probability of later occurrence would be reduced 
progressively with repetition and the 
rising of the reaction threshold. It is proposed that 
a particular response threshold would 


continued 


this rise in 
leave the reaction thresholds for other reactions, in 
the repertoire, relatively low and would enhance 
the probability of the occurrence of these reactions. 
An extension of this line of thought is here pro- 
posed; that repetitive reacting may ultimately raise 
the reaction threshold to the point whereat novel 
reactions, serving the same or similar ends, will 
appear and enrich the variety of the behavior 
repertoire. 

Now, as a special case, involving the principles 
suggested above, it is proposed that when a concept 
is used repetitiously by a person, or the persons of 
a culture, the synonym vocabulary representing the 
concept will be enriched, and that this enrichment 
will be proportional to the repetitiousness or fre 
quency of use. 


PROCEDURE 


Ten samples of words were taken from Thorn 
dike Lorge, The Teacher's Word Book of 
0,000 Each sample consisted of un 


appearing upon 
] 


and 
Wor ds (2). 


words 


capitalized two consecutive 
tenth pages, for which entries also could be found 
in Webster's Dictionary For ex 
amples, Sample 1 was taken from pages 10 and 20; 

and so on through 


of Synonyms (3). 


Sample > trom I wes 20 and 40; 
samples were further delimited by 


page 20¢ The 
including only those words for which specifi 


numerical frequencies were given. Thus the sam- 
ples were confined to those words with usage fre- 
quencies from 1 to 49 per million. Each sample 
was first treated independently as follows: For each 
word in the sample, the number of synonyms ap- 
pearing in the synonym dictionary was obtained. 
In addition, for each word, the combined number of 
synonyms and analogous words was taken from the 
same source (3). With regard to this latter index, 
a word appearing as both a synonym and an analo- 
gous word was counted only once. 

With the use of the three variables described 
above, two product-moment coefficients were com- 
puted for each sample; one between the usage- 
frequency count and synonym count, and the other 
between usage-frequency count and the combined 
synonym, analogous word count. 


RESULTS 


Table 1 contains the 20 product-moment co- 
efficients yielded by the procedures described. 

Of the 20 coefficients shown in Table 1, 
16 are of the expected sign. The largest of 
these has a value of .39. Of the 4 coefficients 
of contrary sign, the largest is —.16. 

Summary estimates of the significance of 
these apparently positive findings were 
arrived at by transforming each of the arrays 
of coefficients to Fisher’s Z’s; computing the 
means of the arrays; and testing the signifi- 
cance of these means as differing from zero. 
Table 2 shows the summary values. 

The values in the last column of Table 2 
appear to permit interpretation in favor of 


1 It is appropriate to point out that this restriction of 
range would tend to militate against the predicted 
phenomena. 

2 The justification for this combination, and for the 
combined treatment described hereinafter, was based 
upon statements of practice contained in the introduction 
of the synonym dictionary. The relevant paragraph is 
quoted below. 

“Some of the analogous words, 
merit the name of ‘near synonyms’ 
some contain much the 
word, but the implica- 
that expressed 


ysely related 


1 
so close 


or terms cl 
in meaning, 
are they to the 
same implications as the 
tion that they emphasize is not the same as 
in the common denotation of the discriminated group of 
part. Some are more 
word, some more 


vocabulary entry: 
entry 


which the entry word forms a 
general than the entry 
come together in only a part of their meaning 

some important particulars they are all like the 


under which they are listed.” 


some 
But in 
word 


specific; 
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the hypothesis under scrutiny. The evidence 
is especially clear if it is permissible to con- 
sider the dictionary’s distinction between 
synonyms and analogous words as irrelevant 
for present purposes. These findings appear 
to support and supplement the previous 
finding that synonym representation covaries 
with the frequency of word use. It is sug- 
gested that these results are in harmony with 
the principle of reactive inhibition as formu- 
lated in the introduction. 


TABLE 2 


MEAN Z's THEIR SIGNIFICANCE DIFFERING 


FROM ZERO 


AND AS 


M 
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r frequency - Number 
of synonyms 

r frequency - Number 
of Synonyms-+Num- 
ber of Analogous 
Words 





SUMMARY 


Ten word samples were taken from the 
Thorndike and Lorge Teacher's Word-Book 
of 30000 Words. For each word in each 
sample, the number of synonyms and analo- 
gous words given in Webster's Dictionary of 
Synonyms was obtained. Measures of co- 
variance between frequency of use, as given 
in the Word-Book, and number of syno- 
nyms and analogous words, as listed in the 
Dictionary of Synonyms, appeared to yield 
evidence in support of the hypothesis that the 
principle of reactive inhibition, or some simi- 
lar principle, operates in the development of 
language. 
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CRITIQUES AND NOTES 
A CAUTION RESPECTING THE MEASUREMENT OF EMPATHIC ABILITY 


A. H. HASTORF anp I. E. BENDER 
Dartmouth College 


ECENTLY a renewed interest has been shown 
R in attempting to understand the problem of 
our perception of other people. An aspect of this 
general problem that certain fesearches have in- 
vestigated concerns the perception of the thoughts 
and feelings of other people, a process that has 
commonly been termed empathy (1, 2, 3, 4, 5, 6). 
It is obvious that the perception of persons lies at 
the very heart of social psychology and that an 
understanding of empathic ability will contribate 
greatly to our -understanding of many problems 
in both social psychological and personality theory. 
Attempts have already been made to study per- 
sonality characteristics of successful and unsuccess- 
ful empathizers (3, 5). The purpose of this paper 
is to raise some questions relative to the measure- 
ment of empathic ability. 

The term empathy has often been used in the 


sense of projecting one’s self into a work of art, 


or a condition of nature such as a storm, or things 
like airplanes or rockets. Warren gives such a 
definition first, but secondly defines empathy as 
“a mental state in which one identifies or feels 
himself in the same state of mind as another per- 
son” (7, p. 92). Among recent researchers, a com- 
monly accepted verbal definition for empathy is 
“the imaginative transposing of one’s self into the 
thinking, acting of another and so 
structuring the world as he does” (4, p. 127). 
Recent research on the measurement of empathy 
has emphasized the attempt to obtain a quantitative 
The most commonly 
to require .subjects to 


feeling and 


such an ability 


inique has been 
verbal responses of another person on 
a rating scale or a personality test. Dymond (4) 
“cts to predict the self-ratings of other 
and 


index of 
used tecl 


predi t the 


asked subje 


people on six traits, such as self-confidence 

Bender and Hastorf (1) studied 
predictions of responses on such tests as the Allport 
and Allport A-S Reaction Study. Notcutt and 
Silva (6) reported predictions on a rating scale 
of eighteen traits, such as abasement, achievement, 


Empathic ability has been usually 


sense of humor 


and affiliation. 
defined in these studies on the basis of deviational 
score; i.c., the best empathizer is the individual 
whose prediction deviates least from the self-ratings 
of the 

The 
search out the psychological implications of this 


person predicted for. 


purpose of the present study has been to 


technique for operationally defining empathic ability. 


The concern over defining empathy by means 
of a deviational score arises as a result of the 
analysis in a previous study of the role played by pro- 
jection in the task of predicting another person's 
responses (1). It was quite apparent that when 
a person was required to predict the scores of 
two other people on one of the tests, that these 
predictions were much more highly related to each 
other and to the predictor’s own score on that 
test than they were to the scores of the people 
predicted for. This result quite definitely raises 
the question of the role projection plays in this 
task. Projection, as used here, is the attribution 
to others of one’s own needs, interests, and attitudes. 

In this study an attempt was made to isolate 
the factor of projection from what is assumed to 
be empathy. Fifty undergraduates in Dartmouth 
College took the Allport-Vernon Study of Values 
and each was asked to choose another under- 
graduate whom he knew well and to predict the 
responses of this associate on the same test. Two 
deviational scores were obtained. The first score 
may be presumed to be the empathy score and 
is a quantitative measure of the deviation of the 
prediction from the actual responses of the associate. 
The second score may be called the projection 
score, and is a measure of the deviation of the 
prediction from the predictor’s own test score. 
It should be kept in mind that if the predictor and 
his associate are highly similar in their responses 
on the Allport-Vernon test that both the empathy 
and the projection deviations might be very low. 
It should be noted as well that these deviational 
scores were not obtained by scoring the Allport- 
Vernon test in the usual sense but by taking a 
The manner 
in which the tests were scored consisted of finding 
the amount of difference in each item between 
the prediction and the associate’s answer; the total 
of these differences provided a raw empathy score. 
Likewise, the difference for each item was found 
between the predictor’s own answers and his pre- 
dictions for his associate; the total of these devia- 
tions constituted the projection score. Since there 
were 30 items in part 1 of A Study of Values with 
possible deviations of six points on each item, 
the deviation score could theoretically be as high 
as 180. In part 2, which consisted of rank ordering 
alternatives, the greatest theoretical deviation could 
be 120 since there are 4 alternatives for 15 items. 


measure of deviation item by item. 
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A Caution Respecrinc THE MEASUREMENT OF EMPATHIC ABILITY 


For example, if in the first item on the test which 
states: 


The main object of scientific research should be the 
discovery of pure truth rather than its practical applica- 
tions; (a) Yes; (b) No, 


the prediction was the (a) alternative with a yalue 
of three, and (b) with a value of zero, and if the 
score of the associate was actually in the reverse 
direction, a value of zero for (a) and a value of 
three for (b), the amount of deviation for the 
empathy score would be six because the predictor 
would deviate three points on each of the two 
alternatives. With a total possible deviation of 
300, the expected deviation would be 150 if only 
chance were operating. The range of the devia- 
tional scores for empathy was 72 to 162, with a 
mean of 104, while the range of the deviational 
scores for projection was 46 to 146, with a mean 
of 96. Although no significant difference between 
the means was found, the predictions tend to 
deviate less from the predictors’ own scores than 
from their associates’ scores. 

The data were then analyzed from the standpoint 
of the following question: In attempting to predict 
the responses of another person, well known to 
the predictor, was the prediction closer to the re- 
sponses of the person predicted for (empathy) or 
was it closer to the predictor’s own score’ (pro 
jection)? This question can be answered simply 
by subtracting the empathy score from the pro- 
jection score, keeping in mind that the smaller 
the deviation the better the prediction. Thus if a 
person’s prediction is closer to the score of the 
associate predicted for than to his own score, 
the result will be a positive score. If he is closer 
to his own score, the result will be negative. This 
method of subtracting the empathy score from the 
projection score yields the following findings: 

1. Of the 50 predictors, 20 are closer to their 
and thus called 
28 are closer to their own score 
The remain- 


associates’ score may be “em- 
pathizers,” while 
and thus may be called “projectors.” 
ing two had equal projection and empathy deviati- in 
scores. Thus, the phenomenon of projection was 
more prevalent than was that of empathy. 

2. A study of the two extreme groups, the ten 
most empathic and the ten most projective, indi- 
cates that the mean subtraction score of the em- 
pathizers is plus 31 and the mean subtraction score 
of the projectors is minus 44. Disregarding signs, 
these means are found to be different at the .o2 
level of confidence. On the basis of this finding 
the phenomenon of projection was not only more 
frequent (see 1. above), but also more intense 
than was that of empathy. 

3. A comparison of the predictors’ own answers 
with those of their associates brings forth the 


fact that the ten “projectors” are significantly 
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(.o2 level) more similar to their associates than 
are the ten “empathizers.” 

4. The most pertinent finding is concerned with 
the relationship between two operational techniques 
for defining empathic ability. The first may be 
called the raw deviational score which essentially 
answers the question of who is able to predict 
most accurately the responses of his associate, re- 
gardless of other factors. This score has most 
commonly been used to define empathic ability in 
recent studies. The second, which we shall call 
the refined empathy score, answers the question: 
Is the prediction score closer to the associate’s 
seore or the predictor’s own score? The rank-order 
correlation between these two sets of raw and 
refined scores was .30, which is significant at 
the .o5 level of confidence. Although there is 
some degree of commonality between the two 
measures, such a low relationship indicates that 
the two scoring methods are not really measuring 
the same ability. Is it not reasonable to assume 
that the refined empathy score, by controlling 
for the factor of projection, more closely approxi- 
mates a measure of empathic ability? Let us take 
a concrete exampie to illustrate what we regard 
as the advantage of the refined empathy score. 
A student deviated only 72 points in his prediction 
of an associate’s score. This places him at the 
top of the list of the raw scores in empathy. How- 
ever, this student's projection score was 70, show- 
ing that he was actually closer to his own answers 
than he was to his associate’s answers. On the 
other hand, another student’s prediction deviated 
108 from his associate’s answers, which places him 
low in empathic ability as defined by raw devia- 
tional score. However, his projection score was 
140, which shows that his prediction was quite 
definitely closer to his associate’s answers than 
his own answers. The question is: Which of 
these students is the better empathizer? Although 
the raw empathy score appears less impressive in 
the latter case, it actually may indicate more em- 
pathic ability; the latter student may apprehend 
more directly the state of mind of his associate. 
With the above example in mind, it would seem 
necessary to employ some procedure of correcting 
for the influence of projection in determining the 
empathy score. It may be overly severe to grade 
empathic ability by means of subtracting the pro- 
jection score, since it would be dangerous to label 
as projection all the overlap between the prediction 
and the predictor’s own answers. For example, 
the deviant “projectors” were found to be more 
similar to their associates than the deviant “em- 
pathizers” and high similarity combined with real 
empathy would create a spuriously high projection 
score. However, without some correction for pro- 
jection, attempts to measure empathy do not 
seem to make psychological sense. 
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The results of this study emphasize the fact that 
successful prediction of another per- 
rather 


part of the 


son's responses may be due to projection 


than empathy and that a refined measure of em- 
pathic ability will approximate more adequately 
the psychological aspects of empathy when it is 
oneself into the thinking, 


structuring 


defined as “transposing 


another and so 


Perhaps we are not yet 


feeling, and acting of 


the world as he doe 
king definitive theoretical distinctions, 


them 


ready for 


but contemporary theory certainly assumes 


Projection is more 
that the 


to be different processes 


and personal than empathy in 


attri es his 


auntist 


projector own feelings to his asso 


ility seems more objective, 


re truly perceptive of the 


ciates Empathic al 


more cognitive, and m 


psychological structure of the other person. It 


ymbination of sensory, imaginative, 
factors, par 


scems to eac 


and intellectual processes Imitative 
ticularly of a_ kinesthetic 
the process We 
for differentiating between projection and 


The next step 


well aid 
meth- 


nature, may 


suggest the need for a 
odology 
empathy in these prediction tasks. 
would seem to be to obtain predictions from. indi- 


viduals for a number of their associates; these asso- 


ciates should differ in the amount of their similarity 


A. H. Hasrorr anp I. E. Benper 


with the predictor. These data could be used to 
determine more clearly the relationship between 
similarity, projection, and empathy. Furthermore, 
when a person has made a number of predictions, 
analysis could be made of the consistency of his 
projection and empathy scores. 
° 
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THE RELIABILITY OF PSYCHIATRIC DIAGNOSES * 


BENJAMIN MEHLMAN 
Toledo State Hospital, Toledo, Ohio 


CLASSIFICATORY system which has little relia- 
A bility has little practical utility. This is true 
tor the determination of the therapy which is 
said to depend upon the diagnosis and for the de- 
termination of groups for research pur- 
poses. The present nosological system, voted into 
currency by the American Psychiatric Association, 
has had few relatively rigorous tests of its relia- 
bility. 

Elkins (5) found little consensus in the judgments 
and interpretations of 39 judges when presented 
with the identical data of a single case. There 
was even disagreement as to whether the case 
history was of a maladjusted or an adjusted indi- 
vidual, of an opportunistic or a loyal person, of a 
passive or a self-sufficient individual. Ali the judges 
agreed that the amount of information presented 
was inadequate, although the kind of further data 
requested appeared to vary with the professional 
group represented. Ash(1) found disagreements 
among prominent members of the same professional 
group, even when face-to-face interviews were held 
There was agreement on less than 50 per 
cent of the 52 cases seen jointly, even when con- 


criterion 


jointly. 


currence on only the major diagostic categories 


was noted. Doering (4), who had three psychia- 
trists work on the same materials, found important 
discrepancies in their reports. Boison (2) reports 
finding dissimilar percentages of cases in diagnostic 
categories when admission statistics for different 
states were compared. 

These studies would seem to indicate that there is 
a problem of reliability in regard to diagnostic pro- 
Cartwright and French (3) suggest that 
the validity of life-history material is greater than 
its reliability. Even if this is true (one might ques- 
tion their use of the subject's statements about him- 


self as the criterion of validity, see Forer [6]), the 


cedures. 


woblem of reliability remains. 
i 
The present study is concerned with the reliability 


of psychiatric diagnoses, not their validity. It may 


be noted, however, ( 
used are not necessarily mutually exclusive (there is 
no reason why a schizophenic individual could not 
alcoholic); (6) that some of the cate- 


impersonal” illnesses (e.g., 


1) that the diagnostic categories 


also be an 


gories 
sclerosis) while others refer to “personal” illnesses 


refer to arterio- 


e.g., schizophrenia); (c) that some categories are 
t 


yredominantly symptomatological (e.g., alcoholism) 
g 


ychology Department 
ratefully acknowl 


my wife and the P 
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while others are more dynamically oriented (e.g., 
psychoneuroses); and (d) that the homogeneity of 
symptoms or dynamics within any single diagnostic 
group or subgroup has never been adequately dem- 
onstrated (e.g., two patients may conceivably exhibit 
different symptomatology—show customary but dif- 
ferent indicators—and apparently different etiol- 
ogies and still be classified catatonic). 

The specific hypothesis tested in this study was: 
Given a large and nonselective sampling of admis- 
sions to a state hospital, the relative frequencies of 
patients in different diagnostic categories will not 
differ from one admitting psychiatrist to another. 


METHOD 


At the Toledo State Hospital patients admitted to the 
male and female services are assigned to the psychiatrists 
according to their order of admission to the hospital. 
If there are four psychiatrists on the female service, 
then fourth patient, in order of admission, is 
assigned to the same psychiatrist. After the psychiatrist 
collects what he regards as sufficient information, he 
arrives at a diagnosis which is copied on a central locator 
card. In the current study the diagnoses made by 
each psychiatrist were tabulated by diagnostic category 
and by psychiatrist. Diagnostic data from the male 
arid female services were not pooled because of the 
different empirical expectancies for the different sexes 
and because psychiatrists did not have equal opportuni- 
ties to be stationed on each of the services. 


every 


RESULTS 


The chi-square technique was used in all results 
reported below and in Table 1. On the female service 
the null hypothesis, that there is no significant dif- 
ference in the relative frequencies of Organic * and 
Psychogenic 3 patients (N=1358) of 16 psychiatrists, 
could be rejected at below the .oo1 level of confi- 
dence. Similarly, there were significantly greater 
than chance differences (P<.o1) in the relative fre- 


category included the following ill 
psychoses syphilitic meningo-encephalitis, 
psychoses with other forms of syphilis of the central 
nervous system, psychoses with epidemic encephalitis, 
psychoses due to alcohol, psychoses due to a drug or 
other exogenous poison, psychoses due to trauma, psy- 
choses with with 
other disturbances of circulation, psychoses due to con- 
vulsive disorder, senile psychoses, involutional psychoses, 


2 The Organic 


nesses: with 


cerebral arteriosclerosis, psychoses 


psychoses due to other metabolic, etc., diseases, psychoses 
due to growth, and psychoses due to unknown 
or hereditary cause but associated with organic change. 
included the following 


manic-depressive psychoses, 


new 
The Psychogenic category 
illnesses psychoneuroses, 
conditions, 

psychoses 


paranoid 
ality 


paranoia and 
ychopathic pers 


dementia praccox, 
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TABLE | 


Cui-Square REsuLts 








LEVEL oF 
SIGNIFI- 
CANCE 


NUMBER OF 
PsyYcHIa- 
TRISTS 


NUMBER OF 
PATIENTS 


D1iacnostic 
GROUPING 





Organic and 
Psychogenic 
Manic-Depressive 
and Schizophrenic 
Organic and 
Psychogenic 
Manic-Depressive 
and Schizophrenic 


>0.001 
>0o.01 


1358° 


>0.001 


>0.01 





* Female patients. 
** Male patients. 


quencies of patients in the Manic-Depressive and 
Schizophrenic categories (N=799). 

On the male service, where data for nine psy- 
chiatrists were utilized, the null hypothesis of no 
significant difference in the relative frequency of 
patients in the Organic and Psychogenic categories 
(N=1282) could be rejected at below the .oor level 
of confidence. For the relative frequencies of 
patients in the Manic-Depressive and Schizophrenic 
categories (N=597), the null hypothesis could also 
be rejected (P<.or). 


Discussion 

Since there is no reason to believe that the 
medical personnel of Toledo State Hospital are 
select in regard to medical training, diagnostic 
ability, or any other essendal and relevant attribute 
from the personnel of other state hospitals, these 
results would appear to represent an addition to 
the body of evidence indicating the inadequacy 
of current nosological practices. Undoubtedly there 
are individual differences in the ability of psy- 
chiatrists to make diagnostic appraisals. However, 
if significant disagreements are discovered in a 
presumably representative sampling of professional 
personnel doing the bulk of diagnostic work, such 
disagreements suggest that the nosological system 
employed should be examined for revision. Re 
search and clinical practices dependent upon no- 
sology can be only as reliable as that nosology. 

That results such as those reported in this study 
have serious implications for clinical research and 
clinical procedures is noted frequently in the cur- 
rent psychological literature. Roe (7), speaking 
generally about current diagnostic procedures, 
comments: 


I suggest that much of this research (the use of 
various psychological devices for making psychiatric 


diagnoses) is not only a waste of time, and a perpetua- 
tion of errors, but is actually preventing advance in 
the ficld. There are many reasons why this is true, 
but one of the most potent is that it involves clinging 
to a classification which has long since been outlived. 
I submit that using techniques which are not too 
precisely vaiidated at all, to place patients in psychiatric 
categories, the inadequacy ot which is admitted by all 
concerned, is a treadmill procedure guaranteed to keep 
us moving in circles. 


The function of classification is pragmatic. Given 
the protean manifestations of normal and dis- 
ordered personalities, psychiatrists have been forced, 
under the pressure of immediate necessity, to 
categorize. That these categories, devised in ur- 
gency, may require some revision in perspective 
is the thesis of this paper. 


SUMMARY 


On the assumption that a classificatory system 
which has low reliability will have little utility, 
the reliability of psychiatric classification has been 
examined statistically. In a hospital where patients 
are assigned in an unbiased fashion for diagnosis to 
the psychiatrists on a service, the frequencies with 
which the various diagnostic entities were used by 
the various psychiatrists were examined to test the 
null hypothesis: if these categories are used reliably, 
the variations in frequency should fall within the 
limits dictated by random sampling. This null 
hypothesis can be rejected with near certainty. 
This finding suggests, as have others, that the 
existing system of psychiatric classification can 
probably have little value for the administrative 
management of patients or for research, and it 
should be revised. 
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